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FOREWORD 

The  analysis  of  variance  is  a  commonly  used  statistical  method  by^which     f'^^ 
estimates  of  a  number  of  variances  are  made  and  by  which  the  significance  of   ^  ''  ! 
the  differences  between  these  estimates  is  determined.  More  recently  the  var-    4 
iances  have  been  used  as  the  basis  for  estimation  of  variance  components.   The   -  fe-^- 
arithmetic  procedures  involved  are  straightforward  when  the  data  are  based  on 
the  same  frequency  in  each  subclass. 

On  the  other  hand,  unequal  cell  frequencies  are  not  uncommon  in  some 
types  of  experimentation.  This  is  particularly  true  in  animal-breeding  ex- 
periments concerned  with  litter  size  and  sex.  In  addition,  animals  are  fre-        ■. 
quently  lost  d\iring  the  experimental  period.  As  a  consequence,  the  ideal  of       \ 
equal  numbers  in  each  subclass  or  classfication  is  seldom  attained. 

The  relation  of  least-squares  procedures  to  the  analvsis  of  variance  in 
recent  years  has  been  emphasizad  by  many  authors.  For  data  conforming  to  the 
conventional  experimental  designs,  short-cut  procedures  are  well  known  which 
make  the  least-squares  method  unnecessary.   However,  it  is  only  recently  that 
the  power  of  least-squares  methods  for  the  analysis  of  data  with  unequal  sub- 
class frequencies  has  been  recognized. 

The  method  of  fitting  constants  (least-squares  analysis)  is  presented  in 
this  article  as  related  to  (1)  the  one-way  classification,  (2)  one-way  class- 
ification with  regression  or  covariance,  (3)  two-way  classification  without 
interaction,  (h)  two-way  classification  with  interaction,  (5)  multiple  and 
nested  classifications,  and  (6)  specific  animal  crossbreeding  experiments, 

-T 

The  least-squares  methods  herein  discussed,  while  related  specifically 
to  animal  breeding  data,  are  equally  applicable  to  data  with  unequal  subclas{5 
frequencies  from  any  other  area  of  research. 

Copies  of  this  article  may  be  obtained  from  Biometrical  Services,  Agri- 
cultural Research  Service,  U.  S.  Department  of  Agricultiire,  Plant  Industry     ^  "^ 
Station,  Beltsville,  Maryland,  *  C 

E.  L,  LeClerg 
Director,  Biometrical  Services  < 

Agricultural  Research  Service 
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LEAST-SQUARES  ANALTfSIS  OF  DATA  WITH 
UNEQUAL  SUBaASS  NUMBERS 

Walter  R.  Harvey  \j 


INTRODUCTION 

Disproportionate  subclass  frequencies  always  cause  the  different  classes 
of  effects  to  be  non-orthogonal.  This  means  that  the  different  kinds  of 
effects,  such  as  years,  sex,  and  age  of  dam,  cannot  be  separated  directly 
without  entanglement.  In  order  to  free  these  effects  from  the  entanglement 
or  confounding,  it  is  necessary  to  resort  to  simultaneous  consideration  of  all 
effects.  With  equal  subclass  frequencies  the  effects  and  sums  of  squares  for 
tests  of  significance  are  obtained  directly  from  the  class  or  subclass  totals 
which  is  equivalent  to  the  simultaneous  consideration  of  all  effects.  In 
this  case  the  design  is  said  to  be  balanced  and  the  effects  are  all  mutually 
orthogonal* 

All  statistics  which  can  be  computed  from  data  with  equal  subclass  fre- 
quencies can  also  be  computed  from  data  with  unequal  subclass  frequencies  by 
use  of  appropriate  statistical  methods  which  are  presently  available.  For 
example,  with  fixed  effects  the  investigator  is  interested  in  the  computation 
of  means,  regression  coefficients,  standard  errors,  tests  of  significance, 
orthogonal  comparisons,  interactions  and  mean  separation  procedures;  whereas, 
with  random  effects  one  is  interested  in  the  estimation  of  variance  components 
andisests  of  significance.  Ifethods  of  computing  these  statistics  from  data 
with  equal  subclass  frequencies  are  well  known  and  are  given  in  most  standard 
textbooks  for  different  designs.  It  is  the  purpose  of  this  presentation  to 
give  presently  available  procedures  for  confuting  all  these  statistics  from 
data  with  unequal  subclass  numbers  in  considerable  detail.  The  methods  are 
presented  by  beginning  with  the  simplest  models  and  progressing  to  the  nwre 
complex  models  that  are  so  often  encountered  in  the  analysis  of  animal- 
breeding  data. 


i/  Biometrician,  In  charge.  Livestock  Research  Staff,  Bioroetrical 
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LEAST-SQUARES  PRINCE PLES  IN  THE  ONE-WAY  aASSIFI CATION 

Methods  of  analjrzing  data  classified  in  only  one-way  are  straight-        / 
forward  even  though  unequal  nunibers  exist  from  class-to-class.  It  is  in- 
tended to  demonstrate  here  that  analysis  by  the  method  of  fitting  constants, 
in  which  direct  matrix  arithmetic  is  involved,  yields  the  same  results  as  the 
standard  methods  of  computations,  thereby  clarifying  the  meaning  of  least- 
squares  analysis,  \ 


/ 


Xi 


Mjdel 

In  any  analysis  it  is  important  that  the  mathematical  model  underlying 
the  analysis  and  the  assumptions  made  in  using  the  model  be  well  known.  In 
the  one-way  classification  the  modal  is: 

i  -  1,  2,  ...,  p 

^  -  1,  2,  ...,  ni, 

where : 

yi1  ■  ^^®  i^^   observation  in  the  i*"  A  class.   (The  A  classes  may 
represent  ages,  or  years,  or  sires,  or  treatments,  etc.), 

\L   ■  the  overall  population  mean  when  equal  frequencies  exist 
among  the  classes  of  A, 

Si^   ■  the  effect  of  the  i"^^  A  class  expressed  as  a  deviation  from 
the  overall  mean  pi, 

eij  ■  random  errors  which  are  assumed  to  be  independent.  If  tests 
of  significance  are  to  be  made  then  it  is  essential  to  assume 
that  the  e±A   are  also  normally  distributed,  i,e.,  N(0,a|), 

Each  of  the  terms  on  the  right  hand  side  of  the  equation  for  the  model 
are  population  parameters  or  constants.  Estimates  of  these  parameters  ob- 
tained in  the  analysis  are  designated  by  placing  a  circumflex  (^ )  over  the 
letter,  e,g,,  |x  is  the  estimate  of  \i,   a^  is  the  estimate  of  n.^   and  Oq   is  the 
estimate  of  a|.  If  the  a^  are  random  effects  then  the  variance  due  to  the 
^±9  ^a*   is  estimated  by  cr  . 

Least-Squares  Equations 

The  least-squares  normal  equations  result  from  the  use  of  a  differential 
calculus  principle.  However,  is  is  unnecessary  to  actually  go  through  the 
differential  calculus  manipulations  in  order  to  write  out  the  equations, 
since  they  follow  a  systematic  pattern.  The  first  point  to  remember  in  the 
construction  of  this. set  of  equations  is  that  there  must  be  one  equation  for 
each  of  the  constants  to  be  estimated.  The  equations  for  the  one-way 
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classification  are  as  follows: 


^l:  n.  ji  4-  n-j^a^  +  n2a2  +^npap  -  T. 
a-i :  n-j  p,  +  "i^i  "  ^1 

82:  n2  ^       +  n2a2       ■  12 

In  this  notation,  the  letter  and  appropriate  subscript  on  the  left  followed  by 
a  colon  denotes  the  equation.  The  second  subscript  is  omitted  in  the  n*s  and 
in  the  Y's  since  this  subscript  is  always  suinined  over.  However,  a  dot  in  the 
subscript  indicates  that  summation  has  been  made  over  that  subscript,  e.g., 
n.  ■  2  ni  where  n-,  ■  the  number  of  observations  in  the  it-h  class.  The  colxiron 

i 
of  values  on  the  right  hand  side  of  the  equal  sign  is  referred  to  as  the 
right  hand  member  (RHM).  Very  often  the  same  variance-covariance  matrix  is 
used  with  several  right  hand  members. 

The  least-squares  equations  may  be  represented  in  tabular  form  as 
follows : 


^1 

h 

- 

- 

- 

^P 

RHM 

V^i 

n. 

"1 

"2 

- 

- 

- 

^ 

T. 

^1' 

^1 

"1 

0 

- 

- 

- 

0 

^1 

a2: 

n2 

0 

ng 

- 

- 

- 

0 

^2 

apt   npO   0 np 


^P 


r 


Four  features  of  least-squares  equations  should  be  noted  from  this 
table.  First,  the  elements  of  the  variance-covariance  matrix  (the  left  hand 
members)  form  a  matrix  symmetrical  about  the  main  diagonal,  i.e.,  the  ele- 
ments on  the  right  hand  side  of  the  diagonal  which  runs  from  left  to  right 
are  a  mirror  image  of  those  on  the  left  side  of  these  diagonal  elements.  If 
the  rows  and  columns  are  numbered  1,  2,  ...p  +  l  and  the  first  number  is 
used  to  identify  a  given  row  and  the  second  number  to  identify  a  column,  then 
the  clement  in  the  12  position  is  the  same  as  the  element  in  the  21  position, 
the  element  in  the  23  position  is  the  same  as  the  element  in  the  32  position, 
etc.  For  the  problem  under  consideration,  elements  in  the  p,  equation,  as  the 
coefficients  for  the  a^^,  are  the  same  elements  as  the  diagonal  elements  in 
the  ai  equations..  Thirdly,  the  off  diagonal  elements  in  the  section  of  rows 
and  columns  for  the  ai  are  all  zero.  The  fourth  feature  'that  shoxild  be  noted 
is  that  the  sum  of  the  coefficients  for  the  a^  in  the  \i   equation  is  equal  to 
the  coefficient  for  p.  in  the  same  equation.  Likewise,  the  sum  of  the  right 
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hand  members  for  the  a^  equations  is  equal  to  the  right  hand  member  for  the  \l 
equation.  All  these  features  are  characteristic  of  original  least-squares 
liquations  in  general,  where  constants  are  fitted  for  classifications. 

The  least-squares  equations  for  the  one-way  classification  may  be  shown 
in  a  more  reduced  form  as  follows: 

u.:  n,  ft  +  Z  m  ai  "  T, 

\ 

a^:  n^  VI  +  n^  ai  -  Yi  -   j 

In  tabular  form,  these  may  be  represented  as  follows: 

Vi  &±         RHM 


\it       n,    n^     Y, 


where  the  zeros  in  the  SixfBL^   section  indicate  that  the  off -diagonal  elements 
in  this  section  are  equal  to  zero, 

Inposing  Restrictions 

As  noted  above,  the  sum  of  the  coefficients  for  the  a^  equals  the  co- 
efficient for  (i.  in  the  p.  equation  and  the  sum  of  the  RHM's  for  the  a^  equations 
equals  the  RHM  for  the  \i   equation.  Since  this  is  true  one  cannot  solve  the 
equations  until  some  restriction  (or  constraint)  has  been  imposed  on  the  con- 
stants. Actually  there  are  only  p  independent  equations  so  that  an  additional 
relationship  must  be  introduced,  generally  known  as  a  constraint  or  re- 
striction, in  order  to  obtain  a  solution.  There  are  an  infinite  nunber  of 
way^  in  ii^ch  this  can  be  done  from  a  mathematical  viewpoint.  However,  only 
two  of  these  are  in  common  use  today  and  they  will  be  considered  here, 

A  direct  method  of  obtaining  estimates  of  the  ai  as  deviations^ from  \i 
is  to  iapose  the  restriction  on  the  least-squares  equations  that  Z  a^^  ■  0» 

When  this  is  done  the  coefficients  of  one  of  the  a^,  say  aL,  must  be  sub- 
tracted from  the  coefficients  of  the  other  ai.  The  resulting  elements  in  the       *■ 
ap  equation  are  then  subtracted  from  the  corresponding  elements  in  the  other       .  ! 
B^   equations  to  maintain  symmetry  in  the  equations.  In  addition,  the  RHM  for       ^-A 
the  ftp  equation  is  subtracted  from  the  RHM^s  for  the  other  ai  equations,  Whea 
these  subtractions  are  completed  the  column  and  row  for  the  ap  equation  is 
deleted  and  the  resulting  syminetrical  set  of  equations  are  solved  to  obtain 
estimates  of  p,  and  the  ai  directly.  After  subtraction  p  equations  remain, 
i.e,,  p-1  for  the  ai's  and  one  for  \l» 

^        One  of  the  simplest  restrictions  (or  constraints)  one  could  impose  on  the 
ai  is  to  assume  that  one  of  the  ai  »  0,  say  ap,  and  simply  delete  the  ap 

e<}uation  and  the  column  of  coefficients  for  ap.  When  this  is  done  and  the 
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remaining  equations  are  solved  to  obtain  estimates  of  the  iinknowns  the 
following  linear  fiinctions  of  the  parameters  are  estimated: 


jii  B  ^  +  ftp 

ai  '  a-^  ^   ap 

^         (^        A 


a'  -  0 


In  the  model  the  aj_  are  expressed  as  deviations  from  p..  Hence,  it  is  logical 
to  assume  that  Z  a^  ■  0  and  obtain  the  constant  estimates  as  deviations  from 

ft.  Of  course,  the  magnitude  and  sign  of  the  differences  among  the  a'j^  and 
among  the  a^^  are  the  same.  After  obtaining  estimates  of  the  a^  one  can  com- 
pute estimates  of  the  ai  by  forcing  the  sum  of  the  aj^  to  equal  zero  since 

"2  Sj^/p  =  ap.  The  overall  mean,  ji,  can  then  be  separated  from  a-  and  the 

i«l 

least  squares  means,  \i   +  a^,  can  be  computed. 

The  restriction  that  the  sum  of  the  constant  estimates  vrithin  a  given 
set  sum  to  zero  is  preferred  to  the  restriction  that  one  of  the  constants  for 
each  set  is  equal  to  zero.  In  addition  to  the  advantage  of  obtaining  the  de- 
sired constant  estimates  directly  there  are  other  important  reasons  for  this 
preference  that  will  be  explained  later. 

Utilization  of  the  >^trix  Inverse 

Numerous  methods  exist»for  obtaining  the  solution  of  a  set  of  simul- 
taneous equations  and  for  obtaining  the  inverse  of  a  matrix.  None  of  these 
will  be  discussed  here.  Instead,  the  use  to  be  made  of  the  inverse  elements 
in  analysis  of  data  will  be  discussed  in  considerable  detail.  Some  of  the 
uses  made  of  the  inverse  elements  are  discussed  below. 

Computing  Estimates  of  the  Constants 

Although  the  constant  estimates  may  be  obtained  by  direct  solution  of 
the  equations,  often  it  is  more  convenient  to  obtain  them  from  the  inverse 
elements  and  the  RHM's  of  the  equations,  since 

2  Cij  Y.  -  a. 

where  C^J  is  the  inverse  element  for  the  ±^^   row  and  J"^^*  column  of  the  com- 
plete inverse  matrix,  Tj  is  the  RHM  for  the  J"t^  row  and  ci  is  the  ith 

-  5- 


w    .i..,«^--.«  nnlw      n<<.nro(luci(on  Ol  This  Horn  Mav  ConatitutA  Caavriaht  Infrlnn«m«nl 


constant  estimate.  For  example, 

a  -  CllY.  +  C12(Y^-Yp)  ^  Cl3(Y2.Yp)  +  ...  +  ClP(Yp__^-yp) 

"a^^  .  c21y.  -h  c22(i^.t^)  +  c23(T2.Yp)  +  ...  +  c2P(Yp_^-Yp) 
etc. 

If  a  method  of  inversion  and  solution  is  used  so  that  both  the  inverse 
elements  and  the  constant  estimates  are  obtained  directly,  then  an  indirect 
check  on  the  accumulation  of  rounding  errors  is  available  by  also  computing 
the  constant  estimates  by  this  method  and  checking  them  against  those  ob- 
tained directly. 


» t 


Standard  Errors  of  Constant  Estimates  or  Linear  Functions  of  Constant  Estimates 
The  standard  error  of  an  estimated  constant,  such  as  p.  or  a^,  is 

s^i  -  V^Cii  oi 

where  C^i  is  the  corresponding  diagonal  inverse  element  for  that  constant  and 

oi   ■  — =—  (22  Jii  -  R(M'>ai)/  for  the  one-way  classification.  The  reduction 
n.-p  l±i       -^  J 

due  to  fitting  all  constants,  .R(p.,aj^),  may  be  computed  from  all  constant  es- 
timates and  the  original  RHM's,  i.e., 

R(M.,ai)  «  ii  Y,  +  aiYi  +  a2Y2  +  ...  +  SpYp 

when  the  restriction  is  imposed  that  2  aj^  "  0,  since  ap  "  ^j  Si,  The  re- 

i  i-1 

duction  in  sum  of  squares  due  to  fitting  all  constants  may  be  computed  more 
easily  from  the  constant  estimates  and  the  reduced  RHM's,  i.e. 


R 


(^i,ai)  -  Jl  Y,  +'ai(Yi-Yp)  +  ,,,  +  ^.i(^_i-V 


The  reduction  in  sum  of  squares  due  to  fitting  all  constants  is  the  sarae 
when  the  ap  equation  is  deleted  by  row  and  column,  i,e., 

R(jx,ai)  -  JL'Y.  +  ajY^  +  ajr^  *   ...  +  ap-1  ^1 

However,  when  interactions  are  included  in  the  model  difficulties  arise  in 
imposing  the  restriction  that  certain  effects  equal  zero.  These  difficulties 
are  discussed  in  the  section  on  the  two-way  classification  with  interaction. 

The  standard  error  of  a  difference  between  two  constant  estimates  is 

s^  g  -  V  (Cii  -^  CJJ  -  2CiJ)  oi 


I' 

u 

N 
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The  same  standard  error  for  a  difference  between  two  constants  is  obtained  re- 
gardless of  the  restriction  iinposed  on  the  constant  estimates.  When  the  re- 
striction was  imposed  that  a^  ■  0,  the  inverse  elements  for  the  ap  colunm  and 
row  all  equal  zero,  but  when  the  restriction  was  imposed  that  2  aj[  ■  0  the  in- 
verse elements  for  the  ap-colunm  and-row  are  computed  from  the  dependencies 
which  cause  the  inverse  elements  within  set  to  sum  to  zero  by  rows  and 
coliunns.  For  example, 

ci(p+i)  --<ci2  +  0I3  +  cP-U  +  ...  +  cP-P) 
C2(p+1)  -  .(c22  +  c23  +  C2U  +  ...  +  c2p) 

mm 

C(p+l)(p+l)  --(Gl(P^l)  +  C2(p-^1)  +  C3(p+1)  +  ...  +  (CP(P+1)) 
The  standard  errors  of  the  least-squares  means  are  obtained  as  follows: 


cd+^i  -  V  (Cll  +  Cti  +  2Cli)  0% 

Regardlei5s  of  the  restriction  imposed,  this  standard  error  will  be  the  same, 
but  only  in  the  case  of  the  one-way  classification.  With  multiple  classi- 
fications, this  standard  error  is  difficult  to  obtain  if  a  coltunn  and  row  are 
deleted  in  each  set  of  constants.  With  the  restriction  that  the  sum  of  the 
constant  estimates  sum  to  zero  within  each  set,  the  least-squares  means  and 
their  standard  errors  are  easy  to  compute  from  the  constants  and  the  inverse 
elements* 


Confuting  Sums  of  Squares  for  the  Analysis  of  Variance 

The  sum  of  squares  for  A  is  easily  obtained  directly  from  the  class  num- 
bers and  class  totals  in  the  one-way  classification.  However,  this  same  sum 
of  squares  may  be  obtained  in  the  more  general  way  as  shown  below; 

S.Sqs.  -  B«  Z-1  B 

where  B'  is  a  row  vector  of  the  constant  estimates  for  a  given  set  (such  as 
the  a^);  Z-1  is  the  inverse  of  the  segment  of  the  inverse  of  the  variance- 
covariance  matrix  corresponding,  by  row  and  column,  to  this  set  of  constants; 
and  B  is  a  column  vector  of  the  set  of  constants.  The  sum  of  squares  ob- 
tained in  this  manner  is  equal  to  the  reduction  in  sum  of  squares  due  to 
fitting  all  constants  minus  the  reduction  in  sura  of  squares  due  to  fitting  all 
constants  except  the  set  being  considered.  This  sura  of  squares  may  be  obtained 
in  this  manner  regardless  of  which  of  the  two  restrictions  have  been  imposed 
on  the  constants.  Obviously,  the  addition  of  the  extra  row  and  column  is  un- 
necessary if  ap  has  been  assumed  to  equal  zero*  The  last  row  and  column  is 

also  unnecessary  if  the  restriction  was  imposed  that  Z  aj^  ■  0« 
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Numerical  Example 

Suppose  the  data  given  in  the  table  below  represent  gains  of  individual 
barrows  that  were  randomly  assigned  to  three  different  rations. 


Ration 

No, 

Pig  No. 

1 

2 

3 

1 

3 

5 

7 

2 

5 

6 

6 

3 

6 

2 

U 

I 

2 

7 

3 

$ 

8 

6 

6 

3 

li 

7 

9 

8 

8 

Yi 

16 

h8 

30 

^i 

U 

6 

5 

Con5)leting  the  analysis  of  these  data  by  least-squares  procedures  requires 
the  following  steps: 

1)  Ma thematic  model: 

y^j  -  |i  +  r^  +  eij 
where : 

y^j  •  the  gain  of  the  j't^  barrow  on  the  i'th  ration. 
(1  ■  overall  mean  with  equal  numbers* 
r^  -  effect  of  the  i"^^  ration. 

e^j  ■  random  error  assumed  to  be  independent  and  normally 
distributed,  i.e.,  NID(0,ae). 

2)  Least-squares  equations: 


p     rn      To      To        RHM  /?•'•■ 


\l^ 

18 

U 

8 

6 

9U 

n« 

I 

U 

0 

0 

16 

^2- 

8 

0 

8 

0 

18 

^3^ 

6 

0 

0 

6 

30 

,s 


3)  Imposing  the  restriction  that  2  r^  -  0: 
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The  reduced  least-squares  equations  are  as  follows: 


^    A 


{i  r^  r2   RHM 


^l:  18  -2  2  9li 
r]_;  -2  10  6  -Ih 
rj:    2   6  lU    18 


U)    Inversion  of  the  variance-covariance  matrix  and  computation  of  the  con- 
stants : 

'  '       .023118     -.018^9^ 

aa3^9     -.0618151 (1) 

.018519     -.O6I16I5       .101852 


/18     -2       2\  /. 060185 

-2     10      6         -       .023IU8 

\2      6    IhJ  V. 018519 


f  -  (.060l85)(9li)  +  (.0231h8)(-lli)  -  (.0l85l9)(l8)  -  5 
ri  -  (.0231h8)(9U)  +  (.Ih35l9)(-lli)  -  (.06h^l5)(l8)  -  -1 
rg  -  (-.018519) (9U)  -  (.06h8l5)(-lli)  +  (.10l852)(l8)  -  1 
r^  -  -(ci  +  r2)  "  0. 

5)     Setting  up  the  analysis  of  variance: 

Error  S.  Sqs  -  2)2  yf*  -  R(pL,ri) 

-  568  -  (5)(9h)  -  (-l)(-lli)  -  (1)(18) 

-  568  -  502  «  66, 

Ration  S.Sqs.   »  B»   Z-^  B 

-Ft     1  If  '1^3519    -.06b8l5f"      F-ll 
L         JL-oSHBB      ,10185a         [  ij 


r  1    ilfiiZIZi       6.2222]    F-l] 
L~         J [6.2222       I3.7778J    I  ij 

[-3.5556    7.5556]    [-jj 


-    11.1112. 

The  analysis  of  variance  is  as  follows: 

Source  of  Variation  d.f.   S.Sqs.   M.S.     F 

Rations        2   11,1112  $.S$S6     1.263  n.s. 
Error         15.  66.0000  h.UOOO 
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It  is  easily  verified  that  the  sum  of  squares  for  rations,  11.1112,  is 
equal  to 

(16)^  +  Mt  ^  i29r     (91)^ 

li      8      6     18 

and  that  the  error  sum  of  squares  is  equal  to 

22  yf .  -  2  Sii  -  568  -  502  -  66 
ij  ^^   i  "i. 

Hence,  in  the  one-way  classification  the  between  group  uncorrected  sum  of 

squares,  2  ^*  equals  R(n,  a^),  even  though  the  number  of  observations 

i  '^. 
vari  s  from  group-to-group.  Also,  in  the  one-way  classification  \i   and  the  r^ 
caoi  be  obtained  directly  from  the  class  or  group  means  as  follows: 

1  Y^   T2        Yp   1   - 

,  ±  (h  +  6  +  5)  «  5  in  this  example,  and 

Ti  -  71  -  JA. 

Hence,  in  the  present  exanple, 

ri  -  h  .  5  -  -1 
r2>  ■  6  -  5  ■  1,  and 
r3  -  5  -  5  -  0  . 

6)  Standard  errors  and  individual  comparisons: 

(a)  The  standard  error  of  an  effect,  such  as  the  t±,   has  little  sig- 
nificance itself  since  it  cannot  exist  without  the  mean.  The 
standard  errors  will  be  computed  for  the  least-squares  means,  jl 
and  the  |x  +  Tj^, 

e^  -  y/  (. 060185) (h.U)  -  .51 
e^^^   -  Vjr;060l85  +  .113519  +  (2)(.0231h8)J  (h.Ij) 


-  y/  (.25)(i».l) 

-  1.05 
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»^I■^^2  -  >/(}060l85  +  .101852  +  (2)(-. 018519)]  (iuli)  | 

-  V(.l25)(U.li) 


lliiil'rr-ag;^-,.  -^fffj^-j -t,  iff  ,    i  -jjlii ^1^  llJtllllTrWTTt 


-   II      I  -    -■    "M." 1  -"■■■-  "'I      ■'■'  '■  •'  ■-■AA..^......b».».»~—      ■-- ■■^.  J 
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-  .71 
s^+j:     -  \/ [.060185  +  .II57I1I  ^  (2)(-.OOli629)J  (h.U) 


-  V  (. 166668) (U.h) 

-  »86 


The  inverse  elements  for  the  coefficients  in  the  least-squares  equation 
that  were  subtracted  out  above  are  necessary  to  obtain  the  standard  error 
for  jl  +  To.  These  were  obtained  as  follows: 

.II57UI  «  .II43519  +  .101852  +  (2)(-.06b8l5) 

and       -.OOI4629  -  -(.0231h8  -  .018519). 

Of  coiirse,  in  the  one-way  classification  it  is  unnecessary  to  go 
through  all  this  manipulation  to  obtain  these  standard  errors.  They  may 
be  obtained  directly  from  the  well  known  formula, 

where  s  is  the  estimate  of  the  error  variance,  a^,  and  n  is  the  number 

in  the  i"^^  group.  In  order  to  show  that  the  two  methods  give  identical 
results,  they  are  computed  directly  below: 


s'^.r^   -  V^  -  l.OS 


'S-rj-^/^-  •7'' 


^+?3  "  V^-  '86 

(b)  Individual  comparisons  among  the  r^ . 

The  well  known  "t"  test  can  be  employed  to  comroare  any  two  of  the 
Ti  effects  if  desired,  but  becomes  less  satisfactory  as  the  number 

of  effects  increase.  The  ratio,  distributed  as  t,  is 


t  - 


Spi-rj 


Tests  of  significance  among  all  pairs  of  the  Tj  in  this  example 
using  the  "t"  test  are  given  below: 

(i)  ?i  .  ?2  -  -2 


•il- 
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\ 


-2 


(ii)  h  - 


t  -  v/fl^'3^9  +  .101852  -  (2)(-.06h8l5)j  (li.l) 

-  \/(.375)(li.h) 

-  -1.56 

-1 


^3 
t 


-1 


(iii)  ?2  - 


^3 
t 


v/[.lU35l9  +  .1157U1  -  (2)(-.07870U)J  {h.h) 

-1 
>/  (.116668) (h.li) 

-1 

TTJFH 
-.7b- 

1 
1 

V^[.10l852  +  .ll57la  -  (2)(-. 037037)]  (li.U) 

1 


-  >/(. 291667) (U.U) 


1.133 


-  .88 

Duncan's  niultinle  range  test,  as  modified  by  Kramer,^  may  be 
employed  to  make  all  pairwise  comparisons  with  the  use  of  the  in- 
verse elements  and  the  standard  deviation  for  error.  If  the  values 


(yi  -  7j) 


/j^ 


cii  -  20^^ 


are  greater  than  erg  2^,  _     the  difference  is  significant;  where   Zp,no 
is  the  significant  studentized  range  value  in  Duncan's  tables  (either 
the  .05  or  .01  table),   p  is  the  number  of  means  in  the  range  chosen 

Z/Kramer,    Clyde  Young.     Extension  of  multiple  range  tests   to  group 
correlated  adjusted  means.     Biometrics  13:  13-18.     1957. 
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and  n2  is  the  number  of  degrees  of  freedom  for  error.  For  com- 
putational convenience  and  for  this  example  these  tests  are  made 
in. tabular  fom  below  at  the  .05  level  of  probability: 


y± 

m. 

h 

Product 
Differences 

1 

2 

Comparison 

v/cii 

+  CjJ-2CiJ 

^e  2p,n2 

1*2  vs  1*3 

1 

2.619 

2.62 

6.31 

r2  vs  r^ 

2 

2.309 

U.62 

6.63 

2*3  vs  fi 

1 

2.191 

2.19 

6.31 

All  differences,  of  course,  are  non-significant  for  this  example. 
These  tests  are  given  merely  to  illustrate  the  computational  pro- 
cedures involved. 

The  sums  of  squares  required  to  perform  tests  of  significance  for 
orthogonal  contrasts  among  a  set  of  constants  are  obtained  by  trans- 
forming the  segment  of  the  inverse  corresponding  to  the  set  of  con- 
stants. This  transformation  is  equivalent  to  the  computation  of  the 
inverse  elements  corresponding  to  the  variances  and  covariances  of 
the  selected  orthogonal  contrasts.  These  elements  are  most  easily 
computed  by  means  of  a  relatively  simrole  matrix  multiplication  pro- 
cediire.  If  T  is  the  transformed  segment  of  the  inverse,  K  the 
transformation  matrix  and  Z^   the  square  segment  of  the  variance- 
covariance  inverse  with  the  additional  column  and  row  added  it  can 
be  shown  that  T  =  KZ^K*. 

The  transformation  matrix  K  is  obtained  from  the  orthogonal  co- 
efficients which  define  the  desired  set  of  orthogonal  contrasts. 
Suppose  the  following  orthogonal  comparisons  are  desired  among  the 
three  rations  in  the  present  example: 


li  l2  l3 
2  -1  -1 
0    1-1 


Then  the  transformation  matrix  is: 


,[2  -1  .11 
a  [O   2  -2j 


It  should  be  noted  that  the  sum  of  the  coefficients  by  row  in  the 
tremsformation  matrix  sum  to  zero  by  row  with  the  sign  considered 
but  to  unity  when  disregarding  the  sign. 

The  transformation  of  the  complete  segment  of  inverse  elements 
for  the  ri  and  the  computation  of  the  sums  of  squares  for  each  con- 
trast are  given  below: 
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\ 


kza 


■il 


-1    -1 

2     -2 


.113^19  -.O6L1815  -.07870li 
-.061.815  .101852  -.037037 
-.0767011     -.037037       .115711 


T  "  KZj^K' 


1    r.b30557 
H  [.027778 

1    pt30557 
U  [.027778 

291671 


i^ 


l5      .083331 


-.19UMi5 
.277778 

-.19l]hh5 
.277778 

.O8333I4' 
1.166668 


-.236112 
-.305556 

-.236112' 
-.305556 


'^ 

0 

1 

-T 

? 

u 

-1 

-2 

An  easier  method  of  computing  T  is  by  subtracting  the  co-» 
efficients  in  the  last  column  of  the  transformation  matrix  K  from  the 
coefficients  in  the  other  columns  prior  to  the  matrix  multiplications. 
In  this  case,  the  last  column  and  row  for  Z  need  not  be  added,  i.e., 

T  »  K^ZK^  -  KZaK' 

The  constants  for  the  two  orthogonal  contrasts  are  obtained  from 
the  transformation  matrix  and  the  r^  constants,  as  follows: 

2  -1  -1 
0   2-2 


1 

IT 


Li' 

*  ^      * 

1 

s 

CI 

0 

C2J 

Therefore,  c^ 


'1  ~  ~  T  ^^^  ^2  '  ^  where  c^   is  the  estimate  of  the 

constant  for  the  contrast.  It  will  be  noted  that  these  constants  are 
one-half  of  the  difference  obtained  for  the  contrast  from  the  f j 

constants  directly  and  the  orthogonal  coefficients. 

The  sums  of  squares  for  the  two  orthogonal  single  degree  of  free- 
dom contrasts  are  now  obtained  from  B'Z-^B  in  the  usual  manner. 

S.Sqs.  for  Ration  1  vs.  Rations  2  and  3 
.  (-3A0^  (161 


1.291671 


6.9677 


S.Sqs.  for  Ration  2  vs.  Ration  3 
„  (1/2)"  (16) 


1.166668 


3.1286 


The  sums  of  squares  for  this  set  of  orthogonal  contrasts  do  not 
add  up  to  the  sum  of  squares  among  the  rations  shown  in  the  an- 
alysis of  variance  table  on  page  9,  i.e.,  6.9677  +  3.1'286  t   11.1112. 
This  is  true  because  of  unequal  numbers  among  rations.  In  effect, 
the  partitioning  of  the  two  degrees  of  freedom  among  the  rations  into 
the  chosen  set  of  orthogonal  comparisons  has  created  a  two-way 
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classification  situation  with  unequal  subclass  numbers.  If  the  same 
means  are  compared  with  weighted  coefficients  that  are  directly  pro- 
portional to  the  unequal  frequencies  the  sums  of  squares  for  the 
contrasts  will  add  to  the  sum  of  squares  for  rations, 

7)  A  short-cut  method  of  computing  the  inverse  of  the  variance-covariance 
matrix  and  the  constant  estimates: 

When  constants  are  fitted  for  all  degrees  of  freedom  among  a  set  of 
^.   classes  or  subclasses  the  inverse  of  the  variance-covariance  matrix  and 
the  constant  estimates  can  often  be  obtained  most  easily  with  the  aid  of 
a  transformation  matrix.  If  D  is  the  diagonal  least-squares  coefficient 
matrix  for  the  classes  or  subclasses  and  K  is  the  transformation  matrix, 
then  the  inverse  of  the  variance-covariance  matrix  (C)  is  computed  from 
KD~^K*,  The  constant  estimates  are  obtained  from  KS  »  B,  where  S  is  a 
column  vector  of  the  class  or  subclass  means  and  B  is  a  column  vector  of 
the  constants. 

The  diagonal  least  squares  coefficient  matrix  (D)  for  the  present 
one-way  classification  example  is 


D 


r^ 

0 

0 

0 

8 

0 

0 

0 

6 

These  are  the  left  hand  members  for  the  least-squares  equations  for  the 
H  +  r-j^.  The  transformation  matrix  is 


K.i 


111 

2  -1  -1 

-1   2  -1 


since  this  is  the  functional  relationship  of  ^,  +  Tj^,  the  ration  means, 
and  the  two  kinds  of  constants  \i   and  r^,   i.e., 

AAA 

81  +  S2  *  83 


n  - 


^^  ^  ^ 

St  +  So  ''"So 


2  *•         1  •*    1  * 

-3^-3  ^2  -3  ^2, 

T/)   ■  -  i  ST  ♦  —  So  -  ±  87  . 

^     3  -L   3  ^   3  :j 

where  the  b^  are  the  subclass  (ration)  means. 

Since  D  is  a  diagonal  matrix,  its  inverse  consists  only  of  re- 
ciprocals of  the  diagonal  elements.  Hence,  the  inverse  of  the  variance- 
covariance  matrix  and  the  constant  estimates  can  be  computed  individually 
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by  setting  up  the  following  table: 


\i- 

ri 

^2 

Ration 
Means 

D-l 

Co 

nstants 

«1 

1 
3 

2 
3 

1 
3 

h 

.250000 

5 

B2 

1 
3 

1 
~3 

2 

3 

6 

.125000 

-1 

^3 

1 
3 

_1 
"3 

1 
'3 

5 

.166667 

1 

The  constants  in  the  last  column  of  the  above  table  are  computed  by  multi- 
plying each  of  the  columns  in  the  transpose  of  the  transformation  matrix  on  the 
left  in  turn  with  the  column  of  ration  means.  For  example, 

JI  -  i  (U  +  6  +  5)  -  5 

ri- i  [(2)a)  -  6  -  5]  -  -1 
?2  -i  [-ii  +  (2)(6)  -  5]  -  1  . 

Since  r^  +  r2  +  r^  »  0,  then  ro  can  also  be  easily  computed. 

The  inverse  elements  of  the  variance-covariance  matrix  are  obtained  by 
multiplying  each  column  of  the  transpose  of  the  transformation  matrix:  by  it- 
self and  with  each  of  the  other  colmnns  in  turn  and  then  multiplying  this  pro- 
duct by  the  D-1  column.  For  example, 

;  C^  -  i  (.250000  +  .125000  +  .166667) 

=  .060185, 

C^^   "  I  [(2) (.250000)  -  ,125000  -  .166667] 

-  .O23IU8, 
etc. 

The  inverse,  obtained  in  this  manner,  is  presented  below: 

.060185   .O231I18  -.018519 


C  - 


.0231h8   .113519  -.06L815 
-.018519  -.06lj8l5   .101852 


It  will  be  noted  that  these  elements  agree  with  the  inverse  given  in  (l). 
The  inverse  computed  in  this  manner  can  be  obtained  as  accurately  as  desired 
since  the  only  source  of  error  is  the  number  of  significant  digits  carried  in 
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the  reciprocals  of  the  diagonal  elements, 
and  the  class  or  subclass  means. 


The  same  is  true  for  the  constants 


Summary  of  the  One-way  Classification  Analysis 

Although  in  practice  one  would  not  use  the  least-squares  procedures 
when  the  data  are  classified  in  only  one  way,  it  is  evident  that  the  same  re- 
sults can  be  obtained  with  these  procedures  as  with  commonly  used  direct  cal- 
culation procedures.  In  adc^ition,  the  principles  and  comnDutational  pro- 
cedures of  least-squares  analysis  can  be  more  easily  introduced  by  way  of  the 
one-way  classification  analysis.  Extension  of  these  orocedures  from  the  one- 
way classification  to  more  complex  designs  is  straight-forward  for  one  who  is 
familiar  with  matrix  arithmetic.  The  extensions  to  the  more  complex  designs 
are  given  in  later  sections.  Also  some  additional  least-squares  comoutational 
procedures  are  introduced  in  the  remaining  sections  which  are  not  apnlicable 
to  the  one-way  classification.  These  are  the  absorption  of  a  set  of  equa- 
tions and  the  comDutation  of  coefficients  for  variance  components  in  the  ex- 
pectation of  adjusted  sums  of  squares  for  effects  that  have  been  absorbed. 
Procedures  for  obtaining  coefficients  for  variance  components  in  least- 
squares  sums  of  squares  or  cross-products  are  given  in  the  section  for  the 
two-way  classification. 


0NE-V7AY  CLASSIFICATION  OTTH  REGRESSION  OR  OOVARIANCE 


Covariance  analvsis  of  data  from  a  one-way  classification  can  be  accom- 
plished by  standard  methods  even  though  unequal  numbers  exist  from  class-to- 
class.  However,  identical  results  can  be  obtained  with  least-squares  pro- 
cedures, which  directly  involve  matrix  arithmetic.  These  procedures  are 
applicable  to  a  wide  range  of  problems.  If  mean  separation  procedures  are  to 
be  used  on  the  least-squares  class  means  or  if  a  set  of  orthogonal  comparisons 
is  desired,  it  is  best  to  obtain  the  inverse  of  the  variance-covariance  matrix, 
An  alternative  procedure  for  6btaining  this  inverse,  which  is  often  useful,  is 
presented  in  this  section  for  the  one-wav  classification,  when  a  continuous3/ 
independent  variable  must  also  be  considered.  The  method  is  applicable  also 
to  more  complex  Droblems, 


Model  V     ■ 

The  mathematical  model  for  the  one-way  classification  with  a  regression 
may  be  written  in  two  ways.  If  the  continuous  independent  variable  (X)  is 
to  be  taken  as  a  deviation  from  the  mean  (of  X),  then  the  model  is  as  follows: 


3/  Actually,  it  is  not  necessanr  for  the  covariate  to  be  a  continuous  var- 
iable. However,  except  for  the  case  of  only  two  classes  such  as  sex,  it 
is  usually  more  satisfactorv  to  fit  individual  constants  for  discrete 
classifications  rather  than  regressions.   Hence,  the  covariate  will  always 
be  referred  to  here  as  a  continuous  variate  rather  than  discrete. 
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i  -  1,  2, ,  p 

j  -  1,  2, ,  Hi 

where:       y^j  ■  the  j"^"  observation  in  the  i'th  A  class, 

\i   »  the  overall  mean  for  the  j^a   when  equal  frequencies 
exist  in  each  of  the  A  classes, 
a^  "  the  effect  of  the  i'th  A  class, 

b  «  partial  regression  of  the  dependent  variable  (y)  on 
the  independent  continuous  variable  (X)  holding  the 
discrete  variable  (the  a^)  constant.  The  discrete 
variable  or  variables  are  held  constant  in  a  co- 
variance  analysis  by  estimating  b  on  an  error  line 
basis.  In  the  one-way  classification  this  means  that 
b  is  estimated  on  an  intra-group  basis, 

I^^  »  the  continuous  independent  variate  for  the  corres- 
ponding y^^  observation.  The  X^a   are  regarded  as 

fixed  and  measured  v/ithout  error, 

X  ■  the  arithmetic  mean  of  the  X^j, 

6^4  «  the  random  errors.  These  are  assumed  to  be  in- 
dependent and  if  tests  of  significance  are  to  be 
made  they  also  must  be  assumed  to  be  normally 
distributed. 

In  practice,  it  is  more  convenient  to  work  with  the  values  of  Xj^^  rather 
than  the  plus  and  minus  deviations  of  the  X^j  from  the  mean,  x.  When  this  is 
done  the  descriptive  working  model  is  as  follows: 

yij  -  a  +  ai  +  bXij  +  .eij  , 

where  the  definition  of  all  common  terms  remains  the  same.  The  new  symbol,  ct, 
is  the  population  mean  when  X  is  equal  to  zero.  Using  this  model  it  is 
therefore  necessary  to  obtain  the  estimate  of  jx,  from 


a  + 


Sx 


Least-Squares  Equations 

The  least-squares  equations  for  the  one-way  classification  with  one  con- 
tinuous independent  variable  are  shown  below  in  tabular  form: 


^&- 


o.tt-nnf  AA  Oniv     nenroduotton  Of  This  Horn  Mnv  Constitute  Coovriaiit  ln(rlna«m«nt 


a 

h 

b 

RHM 

a: 

n. 

^ 

X. 

T. 

^i- 

"i 

o"i° 

Xi 

Yi 

b: 

X. 

Xi 

zzx\i 

^fiJ^iJ 

If  the  X^^  are  expressed  as  deviations  from  x   then  the  equation  for  b 
becomes 


ZZ(Xij  .  x)  ai  +  Z2(Xij  -  x)^S  -  |^(^ij  -  ^)yij 
and  \i  is  estimated  directly  rather  than  a. 

Imposing  Restrictions 

The  restrictions  imposed  in  order  to  obtain  a  \mique  solution  to  these 
equations  are  the  same  as  for  the  one-way  classification  without  the  re- 
gression being  included.  Thus,  no  restriction  is  necessary  for  the  regression 
equation.  In  the  subtraction  procedure  when  the  restriction  of  Z  aj_  =  0  is 

imposed  the  column  and  row  coefficients  for  the  b  equation  are  treated  in  the 
same  manner  as  for  the  RHM  values. 


Inversion  of  the  Reduced  Ifetria:  and  Solution  of  the  Equations 

When  constants  for  all  the  degrees  of  freedom  among  a  set  of  classes  or 
subclasses  are  being  fitted,  in  addition  to  the  regression  (or  regressions), 
an  alternative  procedure  is  available  for  obtaining  the  inverse  of  the  var- 
iance-covariance  matrix.  In  this  case  the  regression(s)  are  being  fitted  on 
a  "within"  class  or  subclass  basis  and  can  be  calculated  from  the  equation(s) 
obtained  on  a  "within"  basis.  For  example,  in  the  simple  case  being  con- 
sidered here  the  least-squares  estimate  of  b  is  given ty 

2Z  Xi.yi.  -  Z  bJk 
^       ij  ^^  ^-^       i  Hi 
b  -  ^ ^5 

Z2  X/a  -  Z  ^ 

ij   "^   i  ^ 

The  diagonal  inverse  element  of  the  variance-covariance  matrix  corresponding 
to  the  b  is 

1 

1 ^ 

The  problem  is  to  adjust  the  inverse  elements  of  the  diagonal  coefficient 
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matrix  of  the  a  +  a±   and  then  transform  this  adjusted  inverse  matrix  vd.th  the 

appropriate  transformation  matrix.  In  order  to  obtain  the  inverse  elements  of 
the  a  +  a^   section  it  is  first  necessarv  to  obtain  the  inverse  elements  of  the 

variance-covariance  matrix  for  the  b-column  or-row.   These  elements  are  re- 
ferred to  as  the  G  section  of  the  variance-covariance  inverse  and  are  obtained 
as  follows:  _ 


Gib  .  G^i 


h 


ni|Z2 


'ii 


i  Hi 


The  diagonal  inverse  elements  for  the  a  +  aj^  section.  A,  are  then  computed 
from 

Aii  -  ~-(l  -  X^Gbi) 
and  the  off -diagonal  inverse  elements  for  the  a  +  Ej^  section  are  given  hy 

Hi  ^ 


or 


Aid 


-  1-(2.G^^) 
nj  J 


The  inverse  of  the  variance-covariance  matrix  for  the  a  and  a^^  separately 
may  then  be  obtained  ft-om  the  matrix  multiplication  KAK',  where  K  is  the 
functional  relationship  matrix  between  the  classes  (the  iS  +  a^^)  and  the  in- 
dividual constants  (a  and  a^),  and  A  is  the  inverse  matrix  for  the  a  +  a^.  In 

the  one-way  classification,  as  was  shown  in  the  numerical  example  in  the  pre- 
vious section  on  the  one-way  classification. 


■   1  11 

p-1  .1  -1 

P  P   P 

-  1  P-1  ,1 

P  P   P 


•  •  •  "- 


•  •  • 

•  •  • 


1 
.1 

P 

1 

P 


1    _1   1  _  p-1  _1 
p   "p   p  "  P   P 


The  estimates  of  the  constants,  a  and  the  aj^,  may  then  be  obtained  in 
either  of  two  ways.  The  inverse  matrix  for  the  a  +  a^  and  b  may  be  multi- 
plied by  the  appropriate  right  hand  irombers,  i.e.  the  Yj_^  and  ZZ  Xj^jy^j,  one 

row  (or  column)  at  a  time,  to  obtain  the  a  +  a^.  Then  by  imposing  the  re- 
striction that  Z  ai  ■  0,  the  estimate  of  a  is  obtained  from 
1  2(a  +  L) 

-   i 

a  ■ 
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1 


i 
p 
i 
I 

I 

I 

I 
I 

I 

p 


and  the  aj_  are  then  computed. 

The  second  method  of  computing  a   and  the  a^  is  from  the  complete  variance- 
covariance  inverse  matrix  for  a,  a^^,  and  b  and  also  the  right  hand  members  for 
the  reduced  least  squares  matrix.  The  inverse  elements  obtained  above 
(the  G's)  for  the  b-row  and-column  must  be  transfcrm.ed  so  that  thev  will  cor- 
respond to  the  variance-covariance  rretrix  for  a,  a^,  and  b  rather  than  to 
a  +  s.±y   and  b.  These  are  easilv  obtained  from  KG,  where  G  is  the  column  vec- 
tor of  the  Gib. 

Computation  of  Sums  of  Squares  and  Standard  Errors 

The  sum  of  squares  for  error  is  equal  to 

2Z  y?.  -  R(a,  a.,  b)  . 
ij  ^ 

The  reduction  due  to  fitting  all  constants  H(a,  a^,  b)  is  equal  to  ^([i,  a^,  b), 
and  is  obtained  from 

a  Y.  +  2  aiYi  +  b  ZZ  ^±^y±^ 

or  more  easily  from        p-1 

S  Y.   -K     Z     ^i   (Yi  -  Y)    *S     ZZXi^yi^   . 
1=1  ij        '^     "^ 

If  the  Xj^j  have  been  expressed  as  deviations   from  x,    then  p.  is  obtained 
directly  rather  than  a,   and 

R{\iy  a^,  b)  =  R(a,  'ai,  b)   »  (1  Y.   +  Z  aiYi  +  B  ZZ  (Xij  -  x)  yij 

p-1 
-  il  Y.   +     Z     a  (Ifi  -  Yp) 

+  b  ZZ  (Xij  -i)yi;j 

In  the  one-way  classification  with  covariance  the  error  sum.  of  squares 
can  be  obtained  bo re  directly,  of  course,  from 


ZZ  y?A   -  2  —  -  b^(ZZ  Xf .  -  Z  ^) 
ij  ^^  ±H  ij  ^J   i  "i 


since  R(|x,  a^,  b)  is  equal  to  the  last  two  terms  in  this  formrila. 

The  sum  of  squares  among  the  A  classes,  adjusted  for  variations  in  the 
mean  of  X,  may  be  obtained  from  B'Z-^B  as  explained  in  the  previous  section. 
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In  the  one-way  classification  with  covariance  this  svnn  of  squares  is  obtained 
by  direct  covariance  procedures.  The  more  general  methods  are  presented  here 
so  that  extension  to  analyses  where  standard  methods  of  analysis  are  un- 
available can  be  made  more  easily. 

The  sum  of  squares  for  testing  the  significance  of  the  regression  co- 
efficient (from  zero)  is  obtained  in  the  same  manner,  i.e.,  B'Z-l-B,  but  in 
this  case  the  seemingly  complicated  matrix  multiplication  reduces  to 

ij   ^   i  Hi 

Standard  errors  for  the  least-squares  means  for  the  A  classes,  (1  +  a^, 
may  be  obtained  from  the  variance-covar lance  inverse  matrix  for  CZ,  a^,  and  b 
as  follows: 

where  a^  is  the  mean  square  for  error  and  the  superscripts  on  the  C's 
Identify  the  inverse  elements. 

The  inverse  elements  that  would  have  been  obtained  for  the  |a-row 
(or-column)  if  the  X's  had  been  taken  as  deviations  from  5c  can  be  computed, 
if  desired,  from 

If  these  are  available,  the  standard  errors  of  the  0  +  a  can  easily  be  com- 


puted from 


Vst  '  ^  ^^'''' ""  ^^^^^ ""  ^^"^^^  ^ 


Computation  of  standard  errors  of  differences  among  the  least-squares 
means  need  not  involve  the  inverse  elements  in  the  ^  or  a  row  (or  column) 
since  

(p+ai)  -  (p+flj)    ai-aj   ^  ^^  j     ^ 

These  standard  errors  of  differences  among  the  a.  are  required  if  mean  sep- 
eration  procedures  are  to  be  employed.  The  inverse  elements  for  the  a^ 
section  can  be  transformed,  as  explained  in  the  previous  section,  if 
orthogonal  comparisons  among  the  a  are  desired. 

Numerical  Example 

The  same  set  of  data  used  in  the  previous  section  is  repeated  below  but 
with  the  addition  of  a  covarlate  as  follows: 
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Ratic 

Dn  Nc 

>• 

1 

2 

3 

Pig  No. 

W 

y 

W 

7 

W 

y 

1 

5 

3 

h 

5 

8 

7 

2 

9 

5 

7 

6 

7 

6 

3 

11 

6 

0 

2 

3 

li 

h 

3 

2 

8 

7 

2 

3 

5 

10 

8 

8 

6 

6 

2 

3 

2 

U 

7 

12 

9 

8 

5 

8 

Totals 

28 

16 

he 

U8 

30 

30 

Means 

7 

h 

6 

6 

5 

5 

1)  Mathematical  model: 

yij  -  a  +  Pi  +  b  Wij  +  eij 

i  -  1,  2,  3 

j  =  1,  2,  ...,  8 
where : 

7±^  "   the  gain  of  the  J^h  barrow  on  the  i^h  ration, 

a  «  the  overall  mean  when  Wj^j  =0, 
Pi  "  the  effect  of  the  i^h  ration, 

b  »  partial  regression  of  gain  on  initial  weight, 

WjL-<  ■  the  initial  weight  for  the  jth  pig  on  the  ith  ration, 

eij  ■  random  errors. 

It  should  be  pointed  out  that  the  assunption  of  random  errors  requires 
that  the  regression  be  linear  and  homogeneous  from  one  ration  to  another. 

2)  Least-squares  equations: 


ri     f2     P^       S       RHM 


a: 

18 

h 

8 

6 

106 

9U 

n- 

U 

h 

0 

0 

28 

16 

rg: 

8 

0 

8 

0 

U8 

18 

^3' 

6 

0 

0 

6 

30 

30 

b: 

106 

28 

18 

30 

832 

656 
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3)  Imposing  the  restriction  that  2  r^^  »  0: 

The  reduced  least-squares  equations  are  given  below  in  tabular  form: 


ri  r2   b    RHM 


a:  18  -2  2  106  9h 

ri:  -2  10  6  -2  -lli 

r2:  2   6  U  18  18 

b:  106-2  18  832  6^ 


h)     Inverse  of  the  reduced  least-squares  equations  and  constant  estimates: 

The  inverse  of  the  reduced  set  of  least-squares  equations  obtained  with 
an  ordinary  method  of  inversion  is  given  below: 

Matrix  Inverse 


^2 


a:         .21x2003       ,053li5l     -.018^9     -.030303 

ri:         .053h5l       .11x8569     -.061i8l5     -.005o5l  .....   (2) 
r2:       -.018519    -.06li8l5       .101852       .000000 
b:       -.030303     -.005051       .000000       .005051 

The  constant  estimates  may  be  obtained  by  multiplying  each  column 
(or  row)  of  the  inverse  by  the  reduced  RHM's.     The  constants  obtained  are: 

a  -     1.7879  r2  -  1.0000 

^L  »  -1.^351  ^  -     .535U  } 

^  -  I 

The  overall  mean  p.  is  then  obtained  from  f 

p,  "  a  +  S  w 

-  1.7879  +  (.5351)  (5. 8889)^  -     \ 

-  U.9I1O8 

The  estimate  of  r^  is  ■  I 

T^  -  -dl  +  r2) 

-  -(-1.535lx  +  1.000) 

-  .5351   . 
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Since, 

the  diagonal  inverse  element  for  ji  +  r^can  be  computed  from 

Cii+ri  -  c^a  ^.  ^2  ^bb  +  c^iri  ^  ^w  (f  ^  +  C^^i  +  2w  C^^^  • 

Another  method  of  computing  this  inverse  diagonal  element,  which  is 
necessary  if  the  standard  error  of  p,  ■•■  r^^  is  desired,  is  possible  from 

However,  the  inverse  elements  involving  \i  are  not  directly  available  from 
the  inverse  matrix  unless  the  \i±A   have  been  expressed  as  deviations  from 
the  overall  mean.  These  inverse  elements  for  the  p.-row  and-column  can 
be  computed  from  the  available  inverse  elements,  as  follows: 

CWt  -  Oaa  +  2i5  C^h  +  ijs  ^bb 

c^i  .  cOTi  ^  -  c^n  ^ 

In  the  present  problem, 

C^  «  .2U2003  +  (2)(5.888889)  (-.030303)  +  (5.888889)^  (.005o5l) 

-  .060265  , 

C**^  -  .053151  +  (5.888889)  (-.005051) 

-  .023706  , 

CP*2  „  ..018519  +  (5.888889)  (.000000) 

-  -.018519  . 

(a)  Alternative  procedure  for  computing  the  inverse  matrix.  As  indicated 
above,  a  method  of  obtaining  the  inverse  of  the  variance-covariance 
matrix  more  easily  in  some  cases  is  available  when  constants  for  all 
degrees  of  freedom  among  the  classes  or  subclasses  are  being  fitted. 
This  method  was  described  above  for  the  one-way  classification  when 
a  regression  coefficient  is  being  fitted  and  will  now  be  illustrated 
with  the  computational  example. 

The  inverse  element  for  the  b,  S  position  of  the  inverse  matrix 
is 

C^t)  . 1 

2ZW?.  -  Z  ]!!i 
ij  ^J   i  n^ 

1 


-25- 


'-^■^■'''!^^'"^^'7^Lf^^:^^,.U^.    Oi   Thin  M«m  M.v  n«n«..»,.».  .^«..,..Sv  ..«,. » 


■ 

I 

p 

I 

I 
I 
I 
I 

I 

I 
I 
I 
I 
I 
I 


-  .0050 5051 

which  checks  with  the  value  for  this  element  obtained  in  the  simul- 
taneous inversion  of  the  complete  matrix.  The  inverse  elements  for 
the  b-colunm  and-row  for  the  a  +  r^  are  obtained  as  follows: 

.lb  „  G^l  , "^ 


"^  (ij  ""^  i  ^_ 

-  -(.00505051)  (^) 

-  -.03535357  , 

G^^  =  G^2  „  .(.00505051)  <^) 

=  -.03030306, 

G^b  „  Qh3   .  (.00505051)  (^) 

-  -.02525255  . 

The  adjusted  diagonal  inverse  elements  for  a  +  r^  are  then  computed 
as  follows: 

All  -2.  (1  -WtG^I) 
°1      ^ 

.  1  [1  -  (28)(-.03535357)] 

-  .U97li7li99  , 

a22  -|  [1  -  (18) (-.03030306)] 

«  .30681836     , 
a33  -|  [1  -  (30) (-.02525255)) 

-  .292929I12. 

The  adjusted  off -diagonal  inverse  elements  for  the  a  +  r^ 
(which  were  zero)  are  then  computed  as  follows: 

a12  -  a21  -  -  i  Wt  G^2 

-  -  1  (28) (-.03030306) 

h 

-  .21212112  , 
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a13  »  a31«.^  (28)(-. 0252525^) 

-  .17676785  , 
a23  =  a32«.,1  (1j8)  (-.02525255) 

=  .i5l5i530. 

Transfornation  of  the  a  +  r^  segment  of  the  inverse  matrix  is  now 
accomplished  as  follows: 


KAK«    -  -r 


1 
3 


111 

2  -1  -1 

-1  2     -1 


.)j971j7Ii99  . 21212112  .17676785 
.21212112  .30681836  .15151530 
.17676785     .15151530     .292929L2 


K' 


.88636U26  .67015508  .62121257 

.60606071  -.03109082  -.09090902 

-.25000000  .25000000  -.16666667 

\2L2003  55  .05315121  -.01851852' 

.053^5121  .11856903  -.06[i8lh8l 

-.01851852  -.06[j81Ii81  .10l85l85 


12-1 
1-12 
1  -1  -1 


These  inverse  elements  check,  within  rounding  errors,  with  those  obtained 
directly  (see  matrix  inverse  (2))  from  matrix  inversion  procedures. 
Actually,  less  rounding  errors  will  accumulate  in  the  computation  of  the 
inverse  elements  with  this  indirect  method  than  when  the  inversion  is 
completed  on  the  least-squares  equations  by  an  elimination  method. 

The  transformed  inverse  elements  for  the  b-column  (and  row)  are  ob- 
tained from 


KG  «  i 


111  -.03535357 

2  -1  -1     -.03030306 

-1  2  -ij    I-. 02525255 

-.03030306'" 
-.QO50505I 
.00000000. 

which  also  check  with  the  values  for  these  elements  that  were  obtained 
directly  (see  matrix  inverse  (2)). 

(5)  The  analysis  of  variance: 

(a)  Sum  of  squares  for  error. 

Error  S.  Sqs,  -  T.Z  yf*   -  R(a,r^,b) 
ij  "^ 


-  568  -(1.7879)(9M-(-1.535h)(-ll) 
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-(1.0000)(l8)-(.5351i)(656) 
-  568  -  558.7806  -  9.219U. 
(b)     Sum  of  squares  for  rations   (R) . 
Ration  S.Sqs.   »  B'Z""^  B 

'  .118569     -.O6I4815' 
,101852 


Pt   cr^tn,    1  nnafl  r»ll8569    -.06I4815I  Pl.535hl 

[-1.5351     1.000^   [..061815       .101852]  [l.OOOoJ 

fi    c-ocn     1   nnnr^J     [^'317661,       5.9291j3i|  pi.535ll 

[-1.535h     1.000C[     [^^^,29131     13.591bh7j  [l.OOOoJ 

[-8.376910    I.U87399]    [l:U^] 


-  17.3U93 

(c)     Sum  of  squares  due  to  regression. 

Regression  S.Sqs.   -  B«Z       B 

,  (.535L)^  ,  (.535U)^  (198) 
.005051 

-  56.7573 

The  analysis  of  variance  is  as  follows: 

M.S.         F 


Source  of  Variation     d.f.       S.Sqs.  _ 

Rations  2       17.3U93       8.67L16     13.17^ 

Regression 

Error 


1   56.7573  56.7573  86.19-»^ 
lU   9.219U   .6585 


Differences  due  to  rations  are  now  highly  significant;  whereas 
in  the  first  section,  using  the  same  data  but  ignoring  the 
initial  weight,  ration  differences  were  not  significant.  The 
error  term  has  been  reduced  from  h.UOOO  to  0.6585. 

(6)  Standard  errors  and  individual  comparisons: 

(a)  Standard  errors  of  the  least  squares  means,  Ji  and  p.  +  r^. 


sfi-  >/(. 060265) (.6585) 
-  .199 


8fl+?i  -  y/  £.060265  +  .lh8569  +  (2)(. 023706)]    (.6585) 
-  Ml  ^ 
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s(i+r2  "  V[.060265  +  .101852  +  (2)(-.Ol85l9)J  (.6585) 

-  .287  , 

sil+r3  -  >/fo60265  +  .120791  +  (2) (-.005187))  (.6585) 

-  .335  . 

(b)     Individual  comparisons  among   the  v^ 
(i)     "t"  test 

ri-r2  -  -2.5351      t  -  -  ^-^^  -  5.068^, 

?l-f^3  «  -2.0708      t  -  -  li^Zg^  -  3.86l^H^, 
rp-ro  -       .1616       t  -  '^  -  1.051  • 
(ii)     Mean  separation  -with  Duncan's  fiiltiple  Range  Test  (.05). 


Product 
Comparison     J±~J^     y/  C^^+C^J-^G'^J     Differences     a^  z 


e   ^Pin2 


r2  vs  r3         .I61i6  2.291 

r2  vs  ri       2.5351  2.1hO 

r3  vs  r^       2.0708  2.596 


1.07  2.U6 
5.lt3^  2.58 
5.28*     2.)i6 


(ill)  Orthogonal  contrasts. 

Suppose  that  it  is  desired  to  test  the  significance  of 
linear  and  quadratic  effects  among  the  r^,  Ip  this  case  the 
transformation  matrix  for  the  r^  segment  of  the  inverse  matrix 
(including  the  third  row  and  coluren;  is. 


and 


1  [2  0  -2] 
"r  Ll  '2.->l\ 


KZK» 


1   h     0-2 
C  [1  -2  1 

1  r.li6Ii6h6 


.11x8569  -.06li8l5  -.08375U 
-.06U815   .101852  -.037037 

.120791 


-.083751    -.037037 

-.055556    -.109090" 
-.305556     .mill 
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S.Sqs.  Linear 


S.Sqs.   Quadratic 


1     ri»7U7li72     .166668] 
"  16  I    .166668     .916668J 

r.  109217     .OlOhl?"] 
[.OlOhl?     ,007292]   * 

ri(2)(.1.535h)  -i(2)(. 535h)) 

rlh u J 

.109217 

-  li^mlkl!  ,  9.816^  . 
.109217 

fc(-1.035l)  -i(2)(l.OOOO)  *^(.^35li)| 

.007292 

„  (--7000)=^  -  9.818**  . 
.007292 


Again,  it  will  be  noted  that  the  total  of  the  sums  of  squares  for  the 
orthogonal  contrasts  is  not  equal  to  the  smn.  of  squares  for  rations. 


Summary  of  the  One-way  Classification  Analysis  with  Covariance 

The  least-squares  procedures  (method  of  fitting  constants)  for  analyzing 
data  with  unequal  numbers  have  been  applied  to  the  analysis  of  data  classi- 
fied in  only  one  way  but  also  considering  a  continuous  covariate.  Detailed 
generalized  procedures  have  been  presented  for  the  computation  of  the 
variance-covariance  inverse  matrix,  the  constant  estimates,  sums  of  squares 
for  the  analysis  of  variance,  standard  errors  of  least-squares  means  and  tests 
of  significance  for  individual  comparisons  or  for  orthogonal  contrasts  among 
the  class  effects.  These  procedures  are  first  presented  in  algebraic  form 
for  this  design  and  then  numerically  using  a  simple  example.  The  basic  pro- 
cedures described  here  are  directly  applicable  to  more  complex  designs  which 
will  be  discussed  in  later  sections. 


TWO-WAY  CLASSIFI CATION  WITHOUT  INTERACTION 

When  disproportionaie  numbers  exist  among  subclasses  in  the  two-way 
classification  the  method  of  fitting  constants  by  least-squares  should 
usually  be  employed.  For  the  special  case  of  only  two  classes  of  A  or  two 
classes  of  B,  short-cut  methods  for  obtaining  sums  of  squares  for  the  analysis 
of  variance  have  been  developed  and  are  given  in  several  statistical  books. 
The  least-squares  method  of  fitting  constants  for  the  general  case  of  the  two- 
way  classification  without  interaction  will  be  presented  here. 


^^ 


■  J 
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Model 

The  model  for  the  two-way  classification  when  the  interaction  of  A  and  B 
is  assumed  non-existent  is 

yijk  -  ^1  +  ai  +  bj  +  ei^k (3) 

i  =  1,  2,  ...,  p 

j  =  1,  2,  ...,  q 

where : 

y.  .,  "   the  k"^^  observation  in  the  j"tb  B  class  and  the  1"^^  A  class, 
\i   »  overall  mean  when  equal  subclass  numbers  exist, 
a^  =  effect  of  the  ±^^   A  class, 
bj  »  effect  of  the  ^^^   B  class, 
®ijk  "  random  errors,  assumed  to  be  NID(o,CTg'), 

This  model  reduces  to  that  for  the  randomized  block  design  only  when  k=l  for 
each  combination  of  i  and  j. 

The  aj_  and  the  bj  may  either  be  regarded  as  fixed  or  random  effects.  If 

both  classes  of  effects  are  fixed,  the  model  is  referred  to  as  the  "fixed" 
model,  the  linear  hypothesis  model,  or  Model  I  of  Eisenhart.  If  both  the  a^ 

and  the  b^  are  randomly  drawn  from  some  infinite  population,  such  as  cows, 
dams,  sires,  etc.,  the  model  is  referred  to  as  the  "random"  model  or  Model  II 
of  Eisenhart.  If  the  effects  for  one  of  the  classes  are  fixed  and  the  other 
randoB,  the  model  is  referred  to  as  the  "mixed"  model. 

If  a  class  of  effects  is  to  be  regarded  as  fixed  (such  as  the  effects  of 
a  set  of  selected  treatments),  then  the  investigator  is  interested  in  least- 
squares  means,  standard  errors,  tests  of  significance,  mean  separation  pro- 
cedures and  possibly  orthogonal  comparisons.  On  the  other  hand,  if  a  class  of 
effects  is  regarded  as  random,  the  investigator  is  primarily  interested  in 
estimating  the  variance  component  from  that  source.  Hence,  with  random 
effects  it  is  desirable  to  have  a  large  number  of  degrees  of  freedom  among  the 
effects  in  order  to  estimate  the  variance  (or  possibly  covariance)  component 
more  accurately.  In  both  cases,  however,  it  is  desirable  to  obtain  the  least- 
squares  sums  of  squares.  With  fixed  effects,  it  is  essential  to  obtain  the 
inverse  of  the  variance-covariance  matrix  in  order  to  make  individual  com- 
parisons. In  many  cases,  it  is  also  desirable  to  compute  the  inverse  seg- 
ment (s)  of  the  variance-covariance  matrix  for  the  random  effects.  The  general 
least-squares  procedure  for  both  kinds  of  effects  will  therefore  be  presented 
and  followed  by  an  absorption  procedure  which  reduces  the  number  of  com- 
putations when  the  random  effects  are  associated  with  a  large  number  of 

-31- 


»w^^p^>ffiP|iiiiBli)ipi.ii].gyjyqpjB!JjP!iBP^Bl^^  Hijlf^ 


■ftniif.o  riiii.ii  liiifiriri-inifir  rMtr<triiitiir(r«mrrtiiifrftri-iTfm[ri»Wg^^  rmUmiMifCgi^  fj^n  -fiPihMfiit>«Mrif>ir .  .ii.ri..-iiri>.i.>-.,^ 

^     »» -.    r»»lw       KanrnHiir>tinn    Of   Thl<»    Hftm    HSnu    nnnattttitm.    r^nnurlr^Sr    Infrln^^m^nt 


H 


degrees  of  freedom. 


Least-Squares  Equations 


The  least-squares  equations  for  the  two-way  classification  are  given  in 
tabular  form  below: 


Bj           RHM 

IA» 

n.. 

ni. 

n.j         Y.. 

H' 

«i. 

o"i.° 

"ij         ^i* 

^y 

^'i 

^ij 

0 

n-j"^     ^-J 

It  should  be  noted  that  the  sum  of  the  coefficients  for  the  a^   in  the  |ji  equa- 
tion equals  the  sum  of  the  coefficients  for  the  Sj  and  the  coefficient  for  jl. 
In  addition,  the  sum  of  the  coefficients  for  the  bj  in  an  a^^  equation  equals 
the  coefficient  for  the  a^^  and  the  total  of  the  RHM's  for  the  a^  equations  and 
the  b^  equations  equals  the  grand  total  of  the  y^j  Y...  In  order  to  solve 

these  equations  or  to  invert  the  coefficient  matrix  it  is  necessary  to  impose 
restrictions  on  the  a^^  and  on  the  bj. 

Imposing  Restrictions 

A  common  restriction  on  these  equations  is  to  set  ap  ■  bq  =  0  and  delete 
the  equations  and  columns  for  flp  and  bq.  The  inverse  of  the  resulting  re- 
duced coefficient-matrix  must  be  transformed  if  the  standard  errors  of  the 
jl  +  ai  or  the  (t  +  S-j  are  desired,  or  if  the  coefficients  of  variance  con^- 

ponents  in  the  expectation  of  mean  squares  are  to  be  obtained  by  means  of  a 
short-cut  procedure  which  will  be  described.  Hence,  it  is  generally  pre- 
ferred to  impose  the  restrictions  that  Za:j  -»  zBj  -0  and  to  make  the 

i     3  ^ 
necessary  subtractions  in  the  least-squares  equations  before  inversion  of  the 

matrix.  It  is  much  easier  to  complete  these  necessary  subti*actions  before 

inversion  than  to  transform  the  inverse  matrix.  In  addition,  the  constants 

obtained  from  the  direct  solution  of  the  reduced  matrix  will  be  in  the  form 

desired,  in  most  cases,  i.e.,  (1  is  estimated  directly  rather  than  as  p,  +  ip 

+  Bq,  the  a^   are  estimated  directly  rather  than  as  a^^  -  a—,  and  the  $4  are  es- 
timated directly  rather  than  as  Bj  -  Bq, 

When  the  restrictions  are  imposed  that  2ai«2B^"0  the  coefficients 

of  one  equation  in  the  a±,  say  ap,  and  one  equation  in  the  bj,  say  bq,  must 
be  subtracted  from  other  coefficients  by  columns  and  rows.  The  subtraction 
of  the  a-  coefficients  is  done  only  within  the  a.  columns  of  coefficients  and 
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the  subtraction  of  the  Sq  coefficients  is  done  only  in  the  Bj  coluinns.  After 

completion  of  this  subtraction  by  columns,  the  coefficients  and  RHM  in  the  re- 
sulting Hp  equation  are  subtracted  from  the  corresponding  coefficients  and  the. 
RIHM's  in  the  other  a^   equations,  A  similar  procedure  is  followed  for  the  bq 

equation  coefficients.  After  these  subtractions  are  completed  there  will  re- 
.  main  1+p-l+q-lorp+q-l  symmetrical  equations.  The  number  of  re- 
■  maining  equations  is  also  the  number  of  degrees  of  freedom  among  the  a^ ,  among 

the  Sj  and  one  additional  for  [i» 

Inversion  of  the  Reduced  Matrix  and  Solution  of  the  Equations 

A  least-squares  matrix  for  the  two-way  classification  can  usually  be  in- 
yerted  without  difficulty  using  ordinary  procedures  and  in  single  precision 
arithmetic  (i.e.,  using  7  or  8  significant  digit  arithmetic)  unless  con- 
founding of  the  effects  is  complete  in  one  or  more  cases.  With  experience  it 
becomes  easy  to  recognize  complete  confounding  of  an  a^^  and  a  b^  effect  (if 
such  should  exist)  from  the  n^j  section  of  the  original  least-squares  equa- 
tions* 

The  reduced  set  of  least-squares  equations  can  usually  be  solved 
directly  to  obtain  estimates  of  the  constants,  \i,   a^,   and  b4,  at  the  same 
time  the  inversion  is  being  made.  The  ap  and  6q  constants  are  obtained  from 

ap  =  -Z  a^ 
^    i 

bq  -  -5  Bj 

and  the  inverse  elements  for  the  ap  and  bq  columns  and  rows  may  be  obtained  in 
&  manner  similar  to  that  explained  in  the  first  section* 

Computing  Sums  of  squares  for  the  Analysis  of  Variance 
The  total  reduction  in  sum  of  squares  is 

p-1  q-1 

R(H,  ai,  bj)  -  ^  T..  -f  2  a^  (Ti.  -  Y.  )+  2  £.  (T..^  -  Y.  ) 

i-1  *   j=l  "^    -^     ^ 

and  the  error  sum  of  squares  is  equal  to 

when  interaction  effects  are  non-existent. 

The  least-squares  or  adjusted  sum  of  squares  for  testing  the  sig- 
nificance of  differences  among  the  A  classes  can  most  easily  be  computed  in 
the  two-way  classification  from  R(p.,  ai,  bj)  -  R{\i,   bj),  where 

-33- 


wgmmmmmmmmmm 

« -._    f\.K\sj       nAnrrtHurtlnn    Ot    ThI*    Itam    lirwi    r.r,na*iti,*»    nnn\irlnl^t    lnrrln<-.Am*nt 


■ 


the  "between"  tmcorrected  svun  of  squares  for  B.  Likewise,  the  least-squares 
sum  of  squares  for  testing  the  significance  of  the  differences  among  the  B 
classes  is  obtained  from 

R(m.>  ajL,  b-j)  -  R(m.,  a^),  where 

the  "between"  uncorrected  sum  of  squares  for  A.  These  sums  of  squares  are 
also  computed  when  variance  components  are  to  be  estimated.  In  this  case, 
coefficients  for  variance  components  in  the  expectation  of  these  sums  of 
squares  must  be  computed.  Methods  for  making  these  computations  are  con- 
sidered in  a  later  sub-section. 

In  the  two-way  classification  analvsis,  the  least-squares  sum  of  squares 
for  the  interaction  of  A  and  B  can  be  computed  indirectly  even  though  the 
interaction  constants  are  not  fitted.  This  is  true  because  the  sum  of  squares 
for  interaction  is  equal  to  Rni,aj^,b-j,(ab)j^ /]  -  R(M.,aj[^,b-<)  and 


E[(i,ai,bj.(ab)i£|  -|Z§^, 


the  "between"  uncorrected  sum  of  squares  for  the  AB  subclasses.  The  same  is 
true  in  least-squares  analyses  with  more  complex  models.  That  is,  when  con- 
stants for  all  effects  among  the  subclasses  are  fitted,  except  the  highest 
order  interaction,  the  sum  of  squares  for  this  interaction  can  be  obtained 
indirectly  from  the  difference  between  the  subclass  sum  of  squares  and  the 
totJtl  reduction  due  to  fitting  constants.  A  similar  indirect  procedure  can 
be  used  to  compute  the  interaction  when  partial  regressions  for  continuous 
variates  are  also  fitted  at  the  same  time,  provided  constants  for  all  effects 
among  the  subclasses  have  been  fitted  except  the  highest  order  interaction. 

The  error  sum  of  squares  for  testing  the  significance  of  interaction 
effects  is 

2ZZ  J.  5^-  -  ZZ  -li 
ijk  ^J^   ij  nij 

the  "within"  subclass  sum  of  squares. 

When  the  interaction  effects  are  found  to  be  significant,  by  means  of  the 
test  of  significance  in  the  analvsis  of  variance,  the  main  effects  and  the 
corresponding  sums  of  squares  are  biased.  This  is  true  because  of  the 
partial  confounding  of  the  main  effects  and  the  interaction  effects  due  to 
unequal  subclass  numbers.  If  interaction  effects  exist  the  main  effects  must 
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therefore  be  estimated  from  averaging  over  the  subclass  meaiiS  rather  than 
from  the  class  means.  When  interaction  effects  exist  in  the  two-way  classifi* 
cation  and  all  subclasses  are  filled,  the  weighted  squares  of  means  procedure, 
described  in  many  textbooks  on  statistics,  may  be  used  to  compute  the  sums  of 
squares  for  the  main  effects  adjusted  for  the  interaction.  The  weighted 
squares  of  means  analysis  is  equivalent  to  the  complete  least-squares  analysis 
when  constants  have  been  fitted  for  all  main  effects  and  interactions.  This 
will  be  shown  to  be  the  case  in  the  next  section  >rtiere  consideration  is  given 
to  the  fitting  of  interaction  constants. 

When  all  subclasses  of  A  and  B  are  not  filled,  the  degrees  of  freedom  for 
the  interaction  sum  of  squares  must  be  obtained  by  difference  rather  than  from 
(p-l)(q-l).  The  number  of  degrees  of  freedom  for  interaction  in  any  case  is 
equal  to  the  number  of  A3  subclasses  minus  the  number  of  A  classes  minus  the 
number  of  B  classes  plus  one. 

Standard  Errors,  Orthogonal  Comparisons  and  Tteaii  Separation  Procedures 

Detail  procedures  for  computing  standard  errors  of  constant  estimates 
and  linear  functions  of  constants  estimates  are  the  same  as  those  described 
in  the  previous  sections  for  the  one-way  classification.  Likewise,  methods 
for  testing  the  significance  of  pairwise  differences  among  least-squares 
means  and  orthogonal  contrasts  as  used  in  the  one-way  classification  are  also 
applicable  to  the  two-way  classification.  In  either  case  the  inverse  of  the 
variance-ccvariance  matrix  is  required. 

Estimation  of  Variance  Components 

When  the  a^   and  the  %*   are  random  samples  from  populations  of  these 
effects,  the  estimates  of  the  variance  components  associated  with  these 
effects  are  of  primary  interest.  These  variance  component  estimates  are  ob- 
tained by  equating  the  least-squares  sums  of  squares  or  mean  squares  to  their 
corresponding  expectations  and  solving  the  resulting  equations.  In  the  two- 
way  classification  without  interaction,  the  analysis  of  variance  is  fis 
follows : 


Source  of  Variation    d.f.        S.Sqs.  E(MS) 

A  p-1   R(^,ai,bj)  -  R(n,bj)  a=  +  k2a| 

B  q-1       R(ji,ai,bp  -  R(^,ai)     a|  +  k^ag 

Error  n,.-p-q+l  2Z2  yfjj^  -  R(^ji,ai,bj)  a| 

The  coefficients  of  the  variance  components  in  the  expectation  of  the  mean 
squares,  E(MS),  may  be  computed  with  either  of  two  methods.  One  method  (the 
first  to  be  described)  may  be  regarded  as  the  direct  method  find  the  other  as 
an  indirect  inethod.  The  direct  method  of  computing  the  coefficients  of  var- 
iance components,  the  k's,  parallels  the  direct  method  of  computing  the  suras 
of  squares  and  the  indirect  method  of  computing  these  coefficients  parallels 
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the  indirect  method  of  computing  the  sums  of  squares. 

It  will  be  recalled  that  any  least-squares  sum  of  squares  may  be  comouted 
directly  from  the  matrix  multiplication  B'Z-1b,  where  B'is  a  row  vector  of  a 
set  of  constants,  Z-1  is  the  inverse  of  the  square  symmetrical  segment  of  the 
variance-covariance  inverse  corresponding  to  this  set  of  constants  and  B  is  a 
column  vector  of  the  constants.  This  is  true  for  the  two-way  classification 
as  well  as  for  any  other  least-squares  analysis.  However,  in  this  case  the 
sums  of  squares  for  A  and  B  can  most  easily  be  computed  indirectly,  i.e., 
from  differences  in  reductions  in  sums  of  squares  as  indicated  above,  ■ 

Direct  Method  of  Computing  the  k's 

If  the  restrictions  are  imposed  that  2  ai  =■  2  bj  =  0  in  the  original 

equations  and  the  least-squares  sums  of  squares  are  confuted  from  B'Z-^B,  the 
variance  component  coefficients  k^  and  k2  are  easily  computed  as  follows: 


<^   p  i  ^    p-1  ii  A 


The  superscripts  on  Z  identify  the  elements  in  the  matrix  inverse  to  the 
square  symmetrical  segment  from  the  variance-covariance  inverse  matrix  and 
the  subscripts  A  or  B  identify  the  segments  for  the  A  effects  and  the  B 
effects. 

Extension  of  the  direct  method  of  comouting  the  coefficient  for  the  major 
variance  component  in  each  line  in  the  E(MS)  for  more  complex  designs  is 
straight-forward.  In  general,  the  formula  for  computing  these  coefficients  is 

k  =i  (ZZ^i  -_1_  Z2Z^^), 
^    i     d.f.  ij    ' 

where  d.f.  is  the  number  of  degrees  of  freedom  for  that  line  in  the  analysis 
of  variance  and  m  is  the  number  of  classes  or  subclasses. 

Indirect  Method  of  Computing  the  k's  li/ 

The  indirect  method  is  accomplished  by  first  computing  the  coefficient 
for  the  variance  component  in  the  reduction  in  sum  of  squares  due  to  fitting 
all  effects  except  the  set  being  considered.  This  coefficient  is  then  sub- 
tracted from  n. ..  and  the  result  is  divided  by  the  degrees  of  freedom  for  that 
source  of  variation  to  obtain  the  coefficient  for  the  variance  comoonent.  The 
problem  consists  of  computing  the  coefficient  for  the  variance  component  in 
the  appropriate  reduction  in  sum  of  squares.  This  is  accomplished  by 

u/  Henderson,  C,  R, Estimation  of  variance  and  covariance  comoonents. 
Biometrics  9'-   226-252.  1953. 
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computing  the  sum  of  crossproducts  between  corresponding  elements  of  two 
square  matrices. 

The  inverse  of  the  variance-covariance  matrix  when  all  effects  except  the 
set  for  which  the  variance  component  coefficient  is  to  be  computed,  are  in- 
cluded in  the  model  is  one  of  the  matrices  required  for  use  of  this  indirect 
method.  For  example,  in  the  two-way  classification  analysis,  if  the  co- 
efficient for  Ga   is  desired,  then  the  inverse  of  the  variance-covariance 
matrix  for  the  the  model  y-j^  °  P-  ■*"  ^j  "•"  ejk  ^^  required.  The  "associated 
sums"  matrix  is  the  second  square  matrix  required  for  use  of  this  indirect 
method  and  is  computed  from  the  matrix  multiplication  N  N',  where  N  is  the 
segment  of  the  original  complete  set  of  least-squares  equations  which  contains 
the  coefficients  associating  the  effects  under  consideration  with  all  others 
in  the  model.  For  example,  in  the  two-way  classification,  N  is  the  n^j 
section  of  the  least-squares  equations  to  the  right  of  the  main  diagonal  and 
N'  is  the  nj^  j  section  to  the  left  of  the  main  diagonal.   If  the  elements  in 
the  inverse  matrix  for  the  least-squares  equations  for  the  reduced  model  are 
RiJ  and  the  elements  in  the  product  matrix,  N  N',  are  %-j,  the  coefficient 
for  a  variance  component  in  the  E(MS)  of  the  analysis  of  variance  is  given  by 

n..  -  iJ 
degrees  of  freedom 

The  quantity 

2Z  R^J  %. 
ij       ^ 

may  also  be  computed  from  the  sum  of  the  diagonals  in  the  matrix  resulting 
from  the  multiplication 

N'R"%  ,   •• 

where  R~^  is  the  inverse  of  the  variance-covariance  matrix  for  the  reduced 

model,  I 

The  indirect  method  of  computing  the  coefficients  for  variance  com- 
ponents is  more  easily  completed  than  the  direct  method  for  the  two-way  K^ 
classification  analysis,  as  was  the  case  in  computing  the  sums  of  squares  for         ^ 
tests  of  significance.  The  indirect  method  of  confuting  the  coefficients  for 
a^  and  o^  reduces  to 

1  f^^ 

1   q=I       i  ni^  '  I 


1  ,       .f^, 


4r/ 


kp  -  T-T  (n..  -  2  


for  the  two-way  classification  without  interaction, 
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Absorption  of  the  \i+b.^   Equation 

Although  the  inverse  of  the  variance-covariance  coefficient  matrix  for 
the  complete  nodel  is  required,  it  can  often  be  obtained  more  easily  by  a 
partitioning  procedure  than  from  direct  inversion  of  the  reduced  set  of  least- 
squares  equations.  The  absorption  of  the  \i-*-a±   equations  into  the  b<  equations 
is  only  one  step  in  this  partitioning  procedure. 

It  was  shown  in  the  one-way  classification  that  the  equations  for  \i+a± 
are  identical  with  the  equations  for  the  a^.  When  p,  is  combined  with  the  a^ 
it  is  unnecessary  to  impose  a  restriction  on  the  ai  since  there  are  p  'degrees 
of  freedom  associated  with  the  ii+a^  but  only  p-1  degrees  of  freedom  associated 
with  the  ai  alone.  This  being  true,  and  with 

1^  +  ai  .  ^  (Yi.  -  2  nijbj) 

it  is  fairly  easy  to  absorb  the  equations  for  n  +  ai  into  the  equations  for 
the  bj. 

If  the  new  coefficients  for  the  bj,  after  absorption  of  the  ji  +  ai,  are 
C(bjb4)  and  C(bjbjt)  and  the  new  right  hand  members  are  S(bj),  the  reduced 
equations  are  as  follows: 


m — 
t>l 


^ 


RHM 


b^:  C(bibi)  C(bib2)  C(bib3)  ...  C(bibq)  S(bi) 
bg:  C(b2bi)  C(b2b2)  C(b2b3)  ...  C(b2bq)  S(b2) 
b3:  C(b3bi)  C(b3b2)  C(b3b3)  ...  «(b3bq)  S(b3) 


... 
... 


bq:  C(bqbi)  C(bqb2)   C(bqb3) 


C(bqbq)   S(bq) 


The  new  coefficients  and  RHM's  are  computed  as  follows: 


2 

C(bjbj)-n.j.zg 


\ 


C(bjbj»)  -  C(bj,bj)  —2    ni.         »  and 


S(b.)  -  T..  -  2  ^H^ULl 


-yt^ 
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Unless  the  number  of  p,  +  aj^  equations  is  large  and /or  the  number  of  b^ 

equations  is  large  this  absorption  can  be  completed  on  an  electric  desk  cal- 
cxolator.  Programs  for  completing  this  absorotion  are  now  available  for  high 
speed  electronic  computers  which  permit  a  wide  variation  in  p  and  q. 

No  restriction  needs  to  be  imposed  on  the  least-squares  equations  until 
•  after  absorption  has  been  completed.  After  absorption  it  will  be  noted  that 
the  sum  of  the  new  coefficients  for  the  bj  sum  to  zero  by  rows  and  columns 
and  that  the.  sum  of  the  RHM's,  by  column,  sum  to  zero.   This  provides  a  check 
■  on  the  absorption  process,  but  does  show  that  dependencies  exist  in  the  re- 
duced set  of  equations.   The  dependencies  are  removed  bv  imroosing  the  re- 
striction that  Z  Sj  =0,  whereby  the  coefficients  for  Sq  are  subtracted  from 

the  coefficients  of   the  other  bj  and  the  resulting  coefficients  in  the  Sq 
equation  are  then  subtracted  from  the  resulting  coefficients  in  the  other  bj 
equations.  The  S(bg)  is  also  subtracted  from  the  other  S(bj)  in  each  of  the 
RHM  columns. 

The  inverse  of  the  reduced  set  of  equations  for  the  bj,  after  absorption 
of  the  jJ.  +  ai  and  after  imposing  the  appropriate  restriction,  is  exactly  the 
same  as  the  inverse  elements  for  the  ^^,$4  segment  in  the  inverse  matrix  of 
the  complete  variance-covariance  matrix.  The  constants  obtained  for  b^  are 
also  identical  to  the  bj  constants  that  would  be  obtained  by  solving  the 
original  set  of  equations.  In  effect,  the  constants  for  the  b^  are  fitted 
on  a  within  A  class  basis  in  either  case. 

The  reduction  in  sum  of  squares  obtained  bv  multiplving  the  S^  by  the 
S(b-j)  -  S(bq)  is  equal  to  the  least-squares  sum  of  squares  for  B,  i.e., 

^-^  /.   r  1 

S.Sqs.  B  .  2  bj  Is(bj)  -  S(bq)J 

«  2  b^  S(b^) 


J 


J  "^°d' 


-  R(pi*ai,bj)  -  R({i,ai), 

Estimates  of  the  \i  +  a±,   the  least-squares  means  for  the' A  classes,   are 
obtained  by  simply  solving  the  equations  for  {x  +  a^  after  computing  the  Sj 
constants.     For  exainple. 


and 


^1-^(^1.  -  Znijbj) 
^i  -  ^  (Yi.   -  2  Hijbj) 
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The  remaining  inverse  elements  for  the  complete  variance-covariance 
inverse  may  be  obtained  if  desired  from  a  relatively  simple  but  general  matrix 
multiplication  procedure.   If  C  is  the  inverse  matrix  of  the  set  of  equations 
remaining  after  absorption  and  after  appropriate  restrictions  have  been  im- 
posed, D   is  tlie  inverse  of  the  diagonal  matrix  for  the  classes  (ji  +  a.)  or 
subclasses  absorbed  and  N^  is  the  mod-lfied  matrix  of  off-diagonal  coefficients 

in  the  original  least-squares  equations  which  associate  the  effects  being  ab- 
sorbed with  the  remaining  effects,  then  the  inverse  elements  for  the  n  - 

-1  ^ 

section,  (G),  are  obtained  from  -D  R^C.   In  the  two  way  classification  with- 
out interaction,  this  is 


G  -  - 


^1. 
0 


0 
^2. 


•  •  • 

•  •  • 


P.J 


'^ll  "12 


°21  "22 


n 


l(q-l) 


n 


2(q-l) 


n  ,  n  ^ 
pi   p2 


p(q-l) 


.11 


.21 


.12 


.22 


.Kq-l) 


.2(q-l) 


•  •  • 

•  •  • 


c(q-i)i  c<*i-i)2 


,(q-l)(q-l) 


where  the  n'   are  obtained  from  the  subtraction  of  the  last  column  of  the  n . . 
from  each  of  the  other  n.   columns,  ice,. 


-  n, 


ij    ij   "iq 

If  D  was  set  equal  to  zero  to  obtain  C,  then  the  last  column  of  the  n. . 
q  '  ij 

can  be  deleted  rather  than  subtracted  out  as  indicated  abovee   However,  in  this 
case  the  inverse  elements  obtained  do  not  apply  directly  to  the  constants  as 
shown  in  the  model  (3).  Also,  the  constants  which  are  thus  obtained  directly 
are  linear  functions  of  |i,  a.,  and  b  rather  than  the  constants  themselves* 


from 


The  inverse  elements  for  the  ^  +  a.  sqiiarc  segment  (A)  are  then  obtained 


A  -  D 


-1 


(I  -  N^G') 


where  I  is  the  identity  or  unit  matrix  and  G'  is  the  transpose  of  Go 

Since  D  is  a  diagonal  matrix  in  the  absorption  of  a  set  of  equations,  the 
matrix  multiplication  procedures  required  to  complete  the  inverse  can  be  done 
rapidly  -  -  even  on  a  desk  calculator  in  many  cases.   In  addition,  the  use  of 
this  method  to  obtain  the  complete  variance-covariance  inverse  often  results  in 
less  accumulation  of  rounding  errors.   It  is  also  possible  with  this  pro- 
cedure to  selectively  compute  the  inverse  for  the  n  +  a.  when  only  certain  of 

these  means  are  to  be  comparedo   If  desired,  the  diagonal  elements  for  the 
^  +  a.  can  be  computed  without  computing  the  off-diagonal  delments.  This  can 
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often  be  accomplished  without  much  effort  even  though  there  are  several 
hundred  \i  +  a^^   being  estimated. 

When  the  ai  effects  are  random  it  is  often  desirable  to  obtain  the 
maximum  likelihood  estimate  for  the  A  classes.   Henderson  (19Ij9)  1/  has  shown 
that  the  maximum  likelihood  estimates  can  be  obtained  by  regressing  the  least- 
squares  means.  The  amount  by  which  the  least-squares  mean  must  be  regressed 
is  a  function  of  the  corresponding  diagonal  inverse  element,  i».e., 


p,  +  a 


^2 

^a 


^+Aiia, 


T? 


(p.  +  ai  -  \i) 


|A 


<7a 


Si- 


Aii( 


(ai) 


where  (i  +  Ij    is  the  regressed  least-squares  mean,   i.e.,   the  maximum  likelihood 
estimate,   and  the  A^i  are  the  diagonal  inverse  elements  for  the  p.  +  ai^ 
Henderson  shows  that  the  maximum  likelihood  estimate  reduces  to 


p,  +  ai 


,"^  ,      (ai) 
l+(n-l)r 


for  the  one-way  classification,  where  n  is  the  number  of  observations  in  the 
i"th  A  class  and  r  is  the  simple  Intraclass  correlation  or  repeatability.  The 
reason  why  these  formulas  are  only  approximations  of  the  maximum  likelihood 
estimates,  rather  than  the  exact  formulas,  is  because  p,  and  the  a^'s  must  be 
known  without  error  to  obtain  the  maximum  likelihood  estimate.   Since  this  is 
impossible  in  practice,  the  estimates  of  p.  and  the  a^'s  are  used  to  obtain  the 
"best"  unbiased  estimate. 


Numerical  Example 

The  same  data  used  for  the  numerical  example  in  the  two  previous  sections 
will  be  used  here  by  rearranging  them  as  follows: 


2/   Henderson,  C.  R,  Estimation  of  general,  specific  and  maternal  combining 
abilities  in  crosses  among  inbred  lines  of  swine.  Unpublished  Ph.D.  thesis. 
Iowa  State  College  Library.  199  p.p. 
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Ration 

Pig 

9ire 

No. 

No. 

No. 

1 

2 

3 

1 

1 

5 

2 

3 

2 

6 

3 

. 

3 

. 

5 

. 

h 

_ 

6 

. 

5 

- 

7 

- 

Sub-totals 

11 

23 

3 

2 

1 

2 

8 

U 

2 

3 

8 

h 

3 

_ 

9 

6 

h 

_ 

_ 

6 

5 

- 

- 

7 

Sub-totals 

5 

25 

27 

Totals 

l6 

16 

30 

Means 

h 

6 

5 

1)  Mathematical  model: 

yijk  -  M.  +  Si  +  r^  +  ei-jk 
i  -  1,  2,  3 

k  «  1,  2,  ..•,  Hij 
where : 

yijk  "  'the  gain  of  the  k^^  barrow  on  the  j"th  ration  by  the  ith  sire^ 
|ji  «  the  overall  mean  with  equal  subclass  frequencies. 
Si  -  effect  of  the  i"*^^  sire, 
Tj  -   effect  of  the  j"^^  ration, 
®ijk  "  random  errors, 

2)  Least-squares  equations: 

p.     Si     S2     S3     Ti     T2       RHM 


ji: 

18 

h 

8 

6 

8 

10 

9h 

si: 

U 

h 

0 

0 

2 

2 

16 

82" 

8 

0 

8 

0 

5 

3 

18 

S3: 

6 

0 

0 

6 

1 

5 

30 

ri: 

8 

2 

5 

1 

8 

0 

37 

ra: 

10 

2 

3 

5 

0 

10 

57 

^  i 
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It  will  be  noted  that  a  number  of  dependencies  exist  in  these  equations. 
For  example,  in  the  ij,  equation  the  sum  of  the  coefficients  for  s^   equals 
the  sum  of  the  coefficients  for  the  r^,  the  overall  number  of  obserp-ations, 
Likewise,  the  sum  of  the  right-hand  members  for  the  si  and  the  rj  equa- 
tions each  equal  the  right-hand  member  for  the  \i   equation.  Hence,  the 
determinant  of  the  coefficient  matrix  is  zero  and  no  imique  solution  or 
inversion  is  possible  without  imposing  some  restrictions  on  the  equations. 


3)  Imposing  the  restrictions  that  Z  s^°   2  ta 


Or 


a  %  S2  ?i 


EHM 


SIS 

=2- 
^1- 


18-2  2-2  9h 

-2  10  6   It  -lU 

2   6  Ih   6  18 

-2  h  6  18  -20 


U)     Inverse  of  the  reduced  least-squares  coefficient  matrix  and  constant 
estimates: 

The  matrix  inverse  to  the  reduced  coefficient  matrix  is  as  follows: 


SI 


^t! 


.061186   .021818  -.022160   .009363 


s^:  ,02l8li8  .1U1819  -.06ll7h  -.009363 
S2:  -.022160  -.06ll7h  .1120U7  -.02621? 
ri:         .009363     -.009363     -.026217       »067U16 


(h) 


The  constant  estimates  obtained  by  multiplying  the  inverse  and  RHM's  of 
the  reduced  matrix  together  are 

ft  »  1.8876     S2  -  1.311j6 

SI  «  -.8876     ri  »  -.8090 

Estimates  of  S3  and  r2  are  then  computed  as  follows: 

S3  -  -(-.8876  -»■  1.3116) 

-  -.1270 
P2  -  -(-.8090)   »  .8090. 

5)  Sums  of  squares  for  the  analysis  of  variance: 

R(n,si,rj)  -  (li.8876)(9lj)  +  (-.8876)(-ll)  +  (1.311i6)(l8)  ^   (-.8090)(-20) 

-  511.7036. 
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The  total  uncorrected  sum  of  squares  for  7ijk  is  22Zy^„    =  568. 
Error  S.Sqs.   »  ^8  -  511.7036  =  56.2961j  with  no  interaction. 

S.Sqs.   Sires(S)   =  R(|i,Sj^,rJ  -  2  -^ 

J        J 
'  ai.7036  -  li96.0250  =  15.6786 


Yf. 


S.Sqs.  Rations(R)   »  R|<u,»4,rj)   -  Z  -il 

*^         i  ni. 


»=  511.7036  -  502.0000  »  9.7036 

S.Sqs,  Interaction  (SR)   »  ZZ  -i^  -  R(M.,s^,rJ 

ij  "ij  -^ 

»  5iil.9333  -  511.7036  =  30.2297 

S.Sqs.  Within  SR  Subclasses  =  568  -  5Iil.9333 

=  26.0667 

The  analysis  of  variance  is  as  follows: 

Source  of  Variation  d.f.  S.Sqs .£/     M.S.     F 

Sires (S)  2  15.6786  7.8393 

Rations(R)  '  1  9.7036  9.7036 

SR  2  30.2297  I5.11b8  6.96^ 

Within  Subclasses  12  26.0667  2.1722 

1/  The  sums  of  squares  for  sires  and  rations  are 
unadjusted  for  the  interaction. 

Since  the  test  of  significance  for  interaction  is  significant  at  the  .01 
level  of  probability,  the  least-squares  analvsis  should  be  comoleted  with 
constants  being  fitted  for  the  interaction  effects  as  well  as  the  main 
effects.  When  all  subclasses  are  filled, 'the  aporooria te  least-squares 
analysis  can  best  be  comnleted  for  the  two-wav  classification  by  the 
method  of  weighted  squares  of  means.  A  discussion  and  the  presentation 
of  methods  of  analvsis  when  the  interaction  must  be  considered  is  de- 
ferred until  the  next  section.  The  analvsis  of  these  data  will  proceed 
as  though  no  interaction  effects  existed. 

With  no  interaction  effects  considered  the  analysis  of  variance  is 
as  follows: 


Source  of  Variation  d.f.   S.Sqs.    M.S. 

Sires(S) 
Rations (R) 
Error 


2 

15.6786 

7.6393 

1.95  n.s. 

1 

9.7036 

9.7036 

2. hi  n.s. 

Ih 

56.296ii 

1.0212 

-hh- 


I 
I 
I 


6)  Standard  errors  and  individual  comparisons: 
(a)  Standard  errors  of  least  squares  means, 


sr,  -  V  (.061L86)(L.0212) 


.<0 


sp.+si  «  v/   E06II186  +  .1) lb 81 9  +  (2)(.0218L8)]  (I1.0212) 

=  1.00. 
s^+92  "  /' [.0611186  +   .II20L7  +   (2)  (-. 022160 5j  (h.0212) 

=  .72. 
s^+%^  «  y/   [.06ll|86  +  .13h5l8  +  (2) (.0003123  (h.0212) 

»  .39. 
s^+?^  =  v/  [.O6II186  +  .067L16  +  (2) (.009363))  (1.0212) 

=  .77 

s^+r2  =  \/  [I061Ii86  +  .067L'l6  +  (2) (-.009363)]  (Ii. 0212) 
»  .67 
(b)     Individual  comparisons  among  the  Sj^. 

(i)     Mean  separation  with  Duncan's  Multiole  Range  Test   (.05  level). 


Product 


ComDarison  yi-y-i   •/ C^^+GJJ-2CiJ  Differences  Cq   Zp,n2 


S2  vs.   S3 

1.7U16 

2.396 

h.l7 

6.08 

S2  vs.   sx 

2.2292 

2.297 

5.12 

6.38 

S3  vs.   s^ 

.Ia606 

2.116 

.97 

6.08 

These  procedures  are  carried  out  on  these  data  only  for  the  purpose 
of  showing  the  computations  involved.  In  practice,  if  the  F  test  in 
the  analvsis  of  variance  is  non-significant,  it  is  doubtful  that  one 
would  want  to  use  a  mean  separation  procedure, 

(ii)  Orthogonal  comoarisons  among  the  s^. 

The  comoutational  orocedures  involved  in  comrouting  the 
least-squares  sums  of  squares  for  orthogonal  contrasts  among 
a  set  of  constants  were  illustrated  in  the  two  previous 
sections  with  the  numerical  example.  Since  this  orocedure  is 
always  the  same  regardless  of  the  other  constants  being  fitted 
in  the  least-squares  analvsis,  it  will  not  be  chown  here. 
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7)  Estimation  of  variance  components. 


The  number  of  degrees  of  freedom  for  sires  and  rations  are  in- 
adequate, of  course,  to  give  accurate  estiirntes  of  variance  components 
for  sires  or  for  rations.  However,  in  order  to  illustrate  the  computa- 
tional procedures,  which  are  directly  applicable  to  analyses  where  the 
number  of  degrees  of  freedom  are  adequate,  the  effects  of  sir^s  and 
rations  will  be  regarded  as  random  and  estimates  of  a|  and  Or  will  be 
obtained. 

The  expectation  of  the  mean  square  for  rations  is  ae  +  ki   a^,,  and  the 
exoectation  of  the  mean  square  for  sires  is  a|  +  k2c|  when  the  inter- 
action does  not  exist.  Since  the  mean  square  for  error  contains  only  Oq 
the  problem  is  to  compute  k^  and  k2.  As  pointed  out  above,  these  k  values 
may  be  comouted  by  either  the  direct  method  or  the  indirect  method.  Al- 
though the  indirect  method  is  the  easier  to  use  in  the  two-way  classi- 
fication both  methods  will  be  oresented  here.  Extension  of  the  direct 
method  to  more  complex  analyses  where  it  is  more  useful  than  the  indirect 
method  is  straight-forward. 

(a)  Direct  method  of  computing  the  k's. 

The  inverse  matrices  to  the  Z  segments  of  the  complete  matirix 
inverse  (Ii)  are  as  follows: 


'^  ■  [: 


II1L8I9  -.061171 
06117U  .1120li7 
-1 


_-l 


8.975050       h.90008li 


U.90008h     11.600112 


Z^  -    [:067lil6]  ^     -  [lli. 833271] 
The  coefficients,  k-^  and  k2,  are  now  computed  as   follows: 

ki  -  I  (lh.83327U)  -  7.117 

kg  -  -[8.875050  +  11.600112  -  I  (I.9OOO8I  +  li.900081i)] 

-i  (15.675078)  =  5,225 
3 

The  sums  of  squares  for  sires  and  rations  may  now  be  verified  from 

S.Sqs.  Sires  =  (-.8876   1.31li6)Zs^B 

-  (-1,5216  10.9002)  B         :  -;   --: 

-  15.683        -.5    I 

S.Sqs.  Rations  -  (-.8090)Zg^  B   "^    -    .  - 17;;  "7 v  .J; 


M 


£,. 
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I 


»  (-12.0001)  B 

«=  9.708 

These  check,  within  rounding  errors,  with  the  s\ms   of  squares  ob- 
tained with  the  indirect  procedure. 

(b)  Indirect  method  of  comouting  the  k's. 

Step  1.   Computation  of  the  matrix  inverses  to  the  reduced, 
matrices. 

The  inverse  of  the  coefficient  matrix  for  p,  +  s^  is 

H+S]^  IJ-+S2  n+s^ 


11+52? 

^1+53: 


1 

r 
0 

0 


0 


0 


1 

8 
0 


0 

1 

z 


RiJ. 


The  inverse  of  the  coefficient  matrix  for  jl+rj  is 

0 


1 

8 

0   1- 

10 


RiJ. 


Step  2.   Computation  of  the  "associated  sums"  matrices,  NN' 
Rations  associated  with  sires: 


Nij 


N  N» 


\  I  [' '  =] 


N. 


ij 


N  Ni 


"8 
IF 

16 
3h 

12 
20 

12 

20 

26 

2 

^  i 

r* 

2 

3  5 

3D 

21 

*- 

2h 

ii 

• 

2     2' 

5    3 
1    5 
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step  3«      Coinputing  the  sura  of  products  of  Rij  and  Nj^4, 
Sires: 

22  R^^  % ^  -  10.583. 
ij  '^ 

Rations: 

22  bM  N^  ^  -  7.550. 
ij  "^ 

Step  U.     Computing  the  k  values. 

ki  -  i2^ip2  -  7.117. 

kj  .  1877-550    .5.225 

These  k  values  check  exactly  with  those  computed  by  the 
direct  method.  With  the  two-way  classification  this  in- 
direct method  reduces  to  the  following: 

k-,  -  18  -  (^>'-^(2>'  .  (^)'-^(3)"  .  (1)^-H(5)" 

-  7.U17, 

ko  -  i  Fib     (2)^-^(^)^-^(i)^     l2)f+Ol!il51fl 
^      2  L     "  8  10  -J 

-  5.225 

(c)  Computing  the  variance  component  estimates. 

The  analysis  of  variance  with  the  expectations  of  the  mean 
squares  is  given  below: 

Source  of  Variation  d.f.   S.Sqs.   M.S>     E(MS) 
Sires (S)  2   15.6786  7.8393  a|+5.225a| 

Rations  (R)         1    9.7036  9.7036  a|+7.1il7a| 


Error  lU   56.2961  L.0212  a, 

^2  .  9.7036  -  [..0212  .  ^755 
r  7.[ii7 

;;=  -  7.8393  -  1.0212  _     -.^ 


2 
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8)     Absorption  of  the  p.  +  a^  equations. 

(a)     Calculation  of  the  new  or  adjusted  coefficients  for  the  r^. 

The  calculations  involved  in  the  absorption  of  the  p.  +  aj^ 
equations  may  be  obtained  from  the  following  table: 


"i.  ^1 


T-,  To  RHM 


p.  +  s^:  Ij  2(. 50000)  2(. 50000)  l6([..0OOO0) 
\i  •*■  S2''  8  5(. 62500)  3 (.37500)  18(6.00000) 
p  +  S3:   6   1(.16667)  5(. 83333)  30(5.00000) 

The  values  in  parentheses  above  are  the  numbers  just  outside  the 
parentheses  divided  by  ni.,  the  diagonal  coefficient  for  the  p  +  Sj^ 

equation.  The  calculation  of  adjustments  which  must  be  made  in  the 
coefficients  for  the  rj  and  in  the  RHM' s  for  the  rj  equations  are  now 
comoleted  as  follows: 

C(riri)  =  8  -(2)(. 50000)  -(5)(. 62500)  -(l)(.l6667)  =  3.70833. 

C(rir2)  =  C(r2ri) 

-  -(2)(. 50000)  -(5)(. 37500)  -(1)(. 83333) 

-  -(.50000) (2)  -(3) (.62500)  -(5) (.16667)  =  -3.70833. 
C(r2r2)  =  10  -(2)(. 50000)  -(3)(. 37500)  -(5)(. 83333)  -  3.70835. 

S(ri)  -  37  -(2)(h. 00000)  -(5)(6.00000)  -(1)(5.00000) 

-  37  -(.50000)(16)  -(.62500)(U8)  -(.16667) (30)  =  -6.00000. 

S(r2)  "  57  -(2)(L. 00000)  -(3)(6.00000)  -(5)(5.00000)  =  6.00000. 

The  reduced  equations  after  absorption  of  the  p,  +  Sj^  equations  are 
as  follows: 


^1 

h 

RHM 

^1* 

rg: 

3.70833 
-3.70833 

-3.70833 

3.70835 

-6.00000 
6.00000 

Except  for  rounding  errors,  the  coefficients  for  r.  sum  to  zero 
by  rows  and  columns  and  the  RHM's  sum  to  zero  by  column, 

(b)  Imposing  the  restriction  that  Z  r-;  =»  0,  comouting  the  r^  and  the 
inverse  of  the  reduced  matrix,  j  *^ 


«U9. 


The  remaining  equation  after  the  subtraction  by  rows  and  columns 
is 

11.8333U  ri  -  -12.00000 

-12.00000 
1  '  lh.8333U 

-  -.8090 
and 

r2  -  -r-L  «  .8090. 

These  are  the  same  values  obtained  for  the  Tj  when  the  solution 
to  all  equations  was  obtained  directly. 

The  inverse  of  the  reduced  matrix,  of  only  order  one  ip  this 
case,  is 

1 


TK:m% 


.067116. 


It  will  be  noted  that  this  is  the  same  inverse  element  obtained  for 
this  position  of  the  matrix  inverse  when  the  inversion  of  the  ji,  »j_ , 
S2  *uid  T^   reduced  matrix  was  completed  simultaneously. 


(c)  Back'  solution  for  the  (1  +  Sj^  constants 

^16  \         /2     2 


i     0     0 


0     0     1 
6 


0     0 


0     i     0 


16.0000 


h9.6l80 
26.761iO 


8 

col 

6 

^  • 

^l.ooooN 

6.2022     • 
b.b607/ 

(d)     Computation  of  the  variance-covariance  inverse  matrix  and 
maxiiraim  likelihood  estimates  for  the  sire  means. 
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The  A  matrix  inverse  applies  directly  to  the  p,  +  s^.  This  matrix 
may  be  transformed,  as  shown  below,  to  give  the  inverse  elements 
which  apply  separately  to  jl  and  the  s^^. 
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.000000  -.011236   .196629 


K» 


-51. 


WKtmmmmimsBfi^gmm 


nsEAnM 


iBagpaaii^ppiijijigig^l^  ""'  ^1-^"  '  mmmm»'t 


B o....r^h  oiirnocAa  Oniv     Heoroducilon  01  ihls  Itsni  tAav  Constltut«  Coovriaiit  Intilna^mmni 


12-1 

1  1  -1  2 
3  1  .1  -1. 


.25pOQa  .117978  .185393 

1  .500000  -.117978  -.185393 

3  -.250000  .26966h  -.219101j 

'  .061L86  .021818  -.022160' 

.021818  .111819  -.061171 

-.022160  -.C6ll7li  .1120L8 


The  transformed  inverse  elements  for  the  t^   column  and  row  are  obtained 
from 


KG  =  1 
3 


111 

2  -1  -1 

-1  2  -1 


0 
-.016851 
-.Ol.Ix9lili 


".009363 
-.009363 
-.026217 

The  variance-covariance  inverse  obtained  by  this  indirect  manner  is  given 
in  the  following  table: 


^ 

Pi 

h 

S2 

h 

\li 

!           .06lli86 

.0218U8 

-.022160 

.009363 

^' 

'.           .02l8h8 

.lUh8l9 

-.06ll7li 

-.009363 

S2! 

!             -.022160 

-.06ll7li 

.1120118 

-.026217 

ri  = 

!                .009363 

-.009363 

-.026217 

.O67I1I6 

It  will  noted  that  these  inverse  elements  check  within  rounding  error  of 
those  obtained  directly.  Of  course,  with  only  four  equations  this  in- 
direct method  is  inefficient.  However,  when  there  are  a  large  number  of 
constants  to  be  fitted  in  one  of  the  two  sets,  the  absorption  procedure 
is  extremely  useful.  In  many  problems,  the  inverse  for  the  section  of 
the  inverse  matrix  involving  the  constants  absorbed  is  not  required.  How- 
.ever,  the  method  of  obtaining  these  inverse  elements  after  absorption  is 
useful  in  some  problems,  particularilv  where  a  large  number  of  constants 
have  been  absorbed  and  tests  of  significance  are  desired  among  only  a 
portion  of  those  depending  on  the  values  obtained  for  jl  +  Sj^,  Also,  only 
the  diagonal  inverse  elements  for  the  {1  +  si-  can  be  computed  by  modifirLng 
this  procedure  slightly.  These  diagonal  inverse  elements  are  required  to 
compute  the  apnroximate  maximum  likelihodd  estimates  of  the  Sj_  effects. 
The  (i  +  Sj_  for  the  numerical  example  are 


»1  +  8]^ 


i  U.8876  + 


.731 


.731  +(. 250000) (h. 0212) 
'  U.8876  +  (.b210)(-.8876) 


(-.8876) 
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.731 


-  li.51 
a  +  82  i  U.8876  -^  731  ^{.I2921h)(ii.0212) 

:  h.8876  +  (.58h5) (1.3116) 

-  5.65 

.731 
a  +  S3  -  1.8876  *.73i.  +(. 196629) (lx.0212)  (-^^270) 

'  -  U.8876  +  (.1)802) (-.1270) 

*  1.68 

The  "best"  estimate  of  the  expected  performance  of  additional  random  pro- 
geny by  each  of  these  three  sires,  vhen  an  equal  number  are  raided  on 
each  of  these  two  rations,  is  given  by  the  approximate  maximum  likelihood 
estimates  above.   The  ranking  of  the  ji  +  Sj^  is  very  likely  to  be  different 
than  that  of  the  jl  +  s£  when  more  numbers  are  available  and  when  more 
variability  in  frequency  occurs  from  subclass-to-subclass.  Hence,  the 
jl  +  s^  are  far  more  useful,  than  are  the  least-squares  means,  the  p.  +  Sj^^ 
in  determining  which  animals  are  to  be  retained  and  which  are  to  be 
culled  in  a.  selection  program. 

TWO-WAT  CLASSIFICATION  WITH  INTERACTION 

If  data  are  classified  in  only  two  ways  and  it  is  necessary  to  consider 
the  interaction  effects,  simplified  methods  are  available  for  completing  the 
least-squares  analysis  unless  one  or  more  of  the  subclasses  are  not  filled. 
For  the  special  case  of  only  two  classes  of  A  or  two  classes  of  B  a  weighting 
method,  described  in  several  books  on  statistical  methods,  provides  a  short- 
cut procedure  for  computing  all  sums  of  squares  in  the  analysis  of  variance. 
VJhen  there  are  more  than  two  classes  for  both  A  and  B  and  all  subclasses  are 
filled,  the  weighted  squares  of  means  method  of  analysis  provides  a  short-cut 
procedure  for  computing  the  least-squares  sums  of  squares-  for  the  main  effects. 
In  this  case,  a  short-cut  procedure  is  also  available  for  computing  the 
variance-covariance  inverse.   This  procedure  will  be  described  and  illus- 
trated with  a  numerical  example  in  this  section. 

When  all  AB  subclasses  are  not  filled  and  the  interaction  must  be  con- 
sidered it  is  necessary  to  use  least-squares  procedures  which  involve  matrix 
arithmetic  in  order  to  obtain  unbiased  estimates  of  the  constants  and  the 
5um^  of  squares  for  tests  of  significance.  This  direct  procedure  also  is 
necessary  if  partial  regression  constants  are  being  fitted  along  with  those 
for  n,  aj^,  bj,  and  the  interaction  constants,  (ab)j^4,  even  though  all  sub- 
classes are  filled.  In  this  latter  case  however,  the  alternative  procedure 
for  computing  the  inverse  of  the  variance-covariance  matrix  that  was  pre- 
sented in  the  section  on  the  one-way  classification  with  covariance  may  be 
useful. 
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The  purpose  of  this  section  is  to  present  tne  general  least-squares- 
analysis  for  data  classified  in  two. ways  when  constants  for  the  interaction 
effects  must  also  be  fitted.   Problems  arising  in  the  least-squares  procedures 
when  interaction  constants  are  being  fitted  are  presented  in  considerable 
detail. 


tfodel 

The  general  linear  mathematical  model  for  the  two-way  classification  with 
interaction  is 

yijk  -  ^t  +  a^  +  bj  +(ab)ij  +  eijk 

i  -  1,  2, ,  p 

j  -  1,  2, ,  q 

k  -  1,  2,- ,  n^j 

Tilk  °  "^^^   ^^^  observation  in  the  j'^^  B  class  and  i'th  A  class, 
VI  «  the  overall  mean  with  equal  subclass  numbers. 


where : 


(ab)ij 


effect  of  the  i"th  A  class, 

effect  of  the  j'th  B  class, 

effect  of  the  ij'^^  AB  subclass  after  the  average  effects  of  A 
and  B  have  been  removed.  These  are  the  individual  interaction 
effects  expressed  as  a  deviation  from  the  mean  p.. 


6^4}^  ■  random  errors.  Assumed  to  be  NID(0,a|), 

The  mathematical  model  is  the  same  regardless  of  whether  the  a^  and/or 
the  bj  are  fixed  or  random  effects.  If  both  are  fixed,  the  interaction 
effects  are  also  fixed.  When  all  effects  except  the  errors  are  fixed  the 
model  is  referred  to  a  Model  I  of  Eisenhart  or  the  fixed  model  and  if  all 
effects  except  \i   are  random  it  is  referred  to  as  Model  II  of  Eisenhart  or  the 
random  model.  The  model  is  regarded  as  mixed  when  one  set  of  effects  (either 
the  a^  or  b-t)  is  fixed  and  the  other  is  random.  In  the  mixed  model  the  inter- 
action effects  are  random.  The  least-squares  procedures  required  to  compute 
the  stun  of  squares  for  the  analvsis  of  variance  are  the  same  regardless  of 
whether  a  set  of  effects  is  random  or  fixed.  With  fixed  effects,  the  least- 
squares  means  with  pairwise  or  orthogonal  comparisons  are  often desired.  On 
the  other  hand,  with  random  effects,  the  investigator  is  interested  in  ob- 
taining unbiased  estimates  of  variance  or  covariance  components.  At  least  a 
portion  of  the  matrix  inverse  to  a  variance-covariance  matrix  is  required  in 
order  to  test  the  significance  of  pairwise  or  orthogonal  comparisons  or  to 
obtain  estimates  of  variance  or  covariance  components.  Where  feasible,  i.e.. 


\ 
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if  the  nuinber  of  equations  is  not  too  large,  the  inverse  of  the  variance- 
covariance  matrix  for  the  conrolete  model  should  be  computed, 

Least^Squares  Equations 

The  least-squares  equations  for  the  two-way  classification  with  inter- 
action are  given  in  tabular  form  below: 


H 

h 

(.^hi 

RHM 

V-' 

n.. 

■4.. 

"•J 

"ij 

T.. 

ait 

^i. 

0"i.  ° 

°1'3 

°i3 

Yi. 

^d' 

°-J 

"IJ 

o-:I,° 

°i3 

^•J 

(ab^j  : 

"id 

°i3 

°ij 

0"i3° 

^id 

This  complete  set  of  equations  contains,  (i)  one  equation  for  \i,    (ii)  one 
equation  for  each  of  the  p  classes  of  A,  (iii)  one  equation  for  each  of  the  q 
classes  of  B,  and  (iv)  one  equation  for  each  of  the  subclasses  of  A  and  B 
which  has  one  or  more  observations.  If  desirable,  however,  constants  for  only 
a  selected  group  of  the  interaction  effects  need  be  fitted.  In  this  case, 
equations  for  only  the  selected  group  of  subclasses  would  be  included  for  the 
interaction  effects. 


Imposing  Restrictions  on  the  Constants 

A  unique  solution  to  the  least-squares  equations  can  not  be  obtained 
until  they  are  reduced  in  number  to  the  number  of  degrees  of  freedom.  Al- 
though numerous  restrictions  may  be  imposed  in  order  to  accomplish  this,  the 
restriction  that  the  constants  for  the  main  effects  sum  to  zero  within  a  set 
and  that  the  constants  for  the  (ab.)^^  sum  to  zero  over  each  row  and  over  each 
column  is  probably  the  roost  satisfactory.  The  linear  mathematical  model  it- 
self suggests  this  set  of  restrictions  since  the  effects  of  aj^,  b4,  (ab)j^j, 

and  the  e±^   are  expressed  as  deviations  from  the  mean  p.. 

Using  the  restriction  which  requires  the  setting  of  one  of  the  constants 
in  each  set  of  main  effects  equal  to  zero  (say,  sL  and  Sq)  suggests,  that  th« 
(at))iq,  the  (al))p-j  and  the  (ab)pq  interaction  constants  could  also  be  set 
equal'  to  zero,  thereby,  allowing  the  deletion  of  all  these  equations.  Al- 
though a  unique  solution  to  the  equations  can  be  obtained  when  this  is  done, 
the  estimates  for  the  constants  are  entirely  unsatisfactory,  since  they  a  re 

|i»  -  ii  +  ap  ■•■  Bq  +  abpq 


^i 


ah^„  .  ab. 


ajL  -  ap  +  abiq  -  abpq 
Sj.fiq 


ab„4  -  abpq 
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(ab)i^ 


(ab)j^j  -  (^'^^iq  -  ^^^^pj  *  ^^^^ 


pq* 


In  addition  to  the  confounding  of  the  constant  estimates  when  using  these 
latter  restrictions  the  total  reduction  in  sura  of  squares  obtained  from 

p-1         q-i         p-1  q-1 

Z  ai  Yi.  +  Z  Si  Y.J  +  Z   Z  (ab).  .  Ti-j 
i"l  ^      j-1  *^      i=l  j=l      ^ 

is  incorrect,  being  biased  upwards.  Also,  the  sums  of  squares  for  the  various 
effects  obtained  from  the  sub-inverse  matrices  and  the  prime  constant  es- 
timates are  incorrect.  Hence,  the  simple  deletion  of  equations  to  remove  de- 
pendencies among  the  least-squares  equations  when  interaction  constants  are 
being  fitted  mav  lead  to  serious  errors. 


By  imposing  the  restrictions  that  Z  ii5_ 


Bj  -  Z(ab)i_j 


3-.   i-  ...   5WiJ_'0 
on  the  prinffi  estimates  of  the  constants  it  is  possible  to  compute  the  a^^,  Sj 
and  (aD)ji^j,  In  addition,  the  inverse  matrix  can  be  transformed  to  give  the 

inverse  that  would  have  been  obtained  had  the  restrictions  that  Z  a-?  =  Z  S^  • 

i      ^     '^ 
Z  (ab)j^j  =  Z(ab)i-j  ■  0  been  imposed  on  the  original  least-squares  equations. 

All  these  manipulations,  however,  require  far  more  work  than  required  in  the 
subtractions  and  additions  within  the  original  equations  to  impose  the  re- 
strictions that  ai  and  Sj  sum  to  zero  and  the  (ab)^^  sum  to  zero  by  rows  and 
columns. 


0  are  im- 


When  the  restrictions  that  Z  a^  »  Z  Bj  ■  Z(at)iJ  =  Z(a1))ij 

posed  on  the  least-squares  equations  it  is  necessary  to  Carry  out  a  number  of 
subtractions  and  additions  within  the  coefficient  matrix  and  the  right  hand 
members.  The  subtractiooBv required  within  the  a^  and  b^  equations  are  the 
same  by  column  and  by  row  as  explained  previously.  Within  the  set  of  coeffi- 
cients for  the  (ab)i-j  the  subtractions  and  additions  which  may  be  con- 
Treniently  chosen  for  each  row  are  as  follows: 

^j  -  ntq  -  npj  +  npq. 

After  these  changes  have  been  made  bv  column  the  same  procedure  is  followed  by 
row  with  the  modified  coefficients  for  the  (ab)ij  equations  and  for  the  RHM's- 
Hence,  the  reduced  RHM's  for  the  remaining  (ab)ij  are 


^j  -  ^q  "  ■^pJ 


-  T-nA    +  X 


pq. 


Considerable  manipulation  of  the  coefficients  and  HHM*s  of  the  least- 
squares  equations  are  required  to  impose  these  latter  restrictions.  Howevei', 
if  desirable,  the  subtractions  and  additions  required  are  easily  programmed  on 
Mgh  speed  computers  and  when  the  number  of  equations  are  small  they  can  be 
completed  quickly  on  a  desk  calculator  or  adding  machine.  By  computing  the 
-mibtractions  and  additions  for  all  elements  that  are  to  remain  in  the  reduced 
matrix  a  check  on  the  computations  is  provided  for  the  off -diagonal  elements 
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of  the  coefficient  matrix,  since  the  reduced  coefficient  matrix  must  be 
s^nranetrical  about  the  main  diagonal. 

Completing  the  Least-Squares  Analysis 

Any  standard  procedure  may  be  used  to  compute  the  matrix  inverse  to  the 
variance-covariance  matrix  and/or  the  solution  to  the  reduced  set  of  equa- 
tions. It  is  often  desirable  to  comoute  the  solution  of  the  equations 
directly  for  at  least  one  RHM,  and  at  the  same  time  as  the  inverse  matrix  is 
being  computed,  in  order  to  check  on  the  accumulation  of  rounding  errors.  The 
difference  between  the  constant  estimates  obtained  from  direct  solution  of  the 
equations  and  those  obtained  by  multiplying  the  inverse  matrix  by  the  reduced 
RHM's  provides  a  measure  of  the  accumulation  of  rounding  errors.  It  is  de- 
sirable to  have  the  two  estimates  of  the  same  constants  agree  to  at  least  four 
significant  digits.   Checking  of  the  off -diagonal  elements  of  the  inverse, 
when  a  n^thod  of  inversion  is  used  which  computes  all  elements  of  the  inverse, 
is  an  undesirable  check  on  the  accumulation  of  rounding  errors  since  the 
rounding  errors  effect  both  off -diagonal  elements  in  much  the  same  manner. 
When  no  complete  confounding  of  effects  remains  in  the  reduced  least-squares 
matrix  and  no  constants  are  being  fitted  for  partial  regressions,  the 
accumulation  of  rounding  errors  is  seldom  a  problem  when  standard  procedures 
using  seven  or  eight  significant  digits  in  all  calculations  are  used  to  invert 
the  matrix. 

The  inverse  elements  for  the  column  and  row  for  ap  (or  bg)  may  be  ob- 
tained by  adding  the  inverse  elements  for  the  ai  columns  (or  rows)  and  re- 
versing the  sign  of  the  sum  as  explained  in  the  first  section.  The  same  type 
of  procedure  is  used  to  obtain  the  inverse  elements  for  the  interaction 
columns  and  rows  which  were  eliminated,  e.g., 

q-1 
OiAb±q   .  c^biqP.  „  _2  C^^^ij 

Caiabiq  ,  Qab^^i.  .  ^^     C^i^^ij 
^jbjabiq  ,  ^abiqbj  _  ^f  c^jabij 

i-l 

CJiabpq  .  c^bpqji  ,  fj^  cM^biq  .  ^~     (>^bpj 

i-l  j-i 

etc. 

The  sum  of  squares  for  error  in  the  analysis  of  variance  is  computed  from 
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If  yijk-«ft^*^'^j>(^^>ij]> 


where    «  ^  p-1  q-1 

i-1 . j»l 

The  total  reduction  in  sum  of  squares,  R|Vi,ai,b4,  (ab)ijj ,  niay  also  be  com- 
pruted  from  2Z  ^3  in  this  case,  since  all  of  the  variability  among  the  AB  sub- 

classes  is  accounted  for  by  the  constants  being  fitted.  Hence,  by  computing 
this  total  reduction  in  both  ways  another  check  is  provided  on  the  accumul- 
ation of  rounding  errors  during  the  solution  of  the  equations. 

The  sums  of  squares  for  A,  B,  and  AB  may  be  computed  by  the  direct  pro- 
cedure, B*Z-1b,  which  involves  the  segments  of  the  matrix  inverse  to  the  var- 
iance-covariance  matiTJc  and  the  constant  estimates  or  from  the  indirect  pro- 
cedure involving  differences  in  various  reductions  in  sums  of  squares.  If  the 
indirect  procedure  is  used  the  solution  to  other  sets  of  equations  is  required. 
For  example,  the  sum  of  squares  for  A  (using  the  indirect  method)  is 
Rlpi,ai,b-j,(ab)i-j']  -  R[ii,b4,  (ab)j^  J  .  The  R[p.,bj,  (ab)^^  Jj  is  obtained  by  deleting 
all  equations  for  the  aj_  (by  column  and  row)  from  the  reduced  set  of  least- 
squares  equations,  solving  the  remaining  equations  and  computing  the  total  re- 
duction in  the  usual  manner. 

When  the  number  of  degrees  of  freedom  for  the  interaction  effects  is 
large  the  sum  of  squares  for  interaction  can  usually  be  obtained  more  easily 
from  R[M.,ai,bj,(ab)i£j.  -  R(|i,ai,b4)  than  from  B*Z-^B,  even  though  the  com- 
plete inverse  of  the  variance-covariance  matrix  is  available.  When  the  inter- 
action constants  must  be  fitted  along  with  the  main  effects,  the  sums  of 
squares  for  the  main  effects  can  usually  be  computed  direqtly  from  B'Z-^B  more 
easily  than  from  differences  in  reductions.  However,  if  all  subclasses  are 
filled  the  sums  of  squares  for  the  main  effects  can  be  obtained  with  the 
weighted  squares  of  means  procedure  more  easily  than  with  either  of  the 
methods  considered  above. 

The  least-squares  means  for  the  classes  of  A  and  the  classes  of  B  are 
\L  *  Si±   and  p-  +  Sj,  respectively.  The  standard  errors  for  jl,  p.  +  a^,  and 

ji  +  Dj  are  computed  in  the  same  manner  as  described  in  the  previous  section 
from  the  appropriate  inverse  elements  and  the  estimate  of  Oq   from  the  error 
line  of  the  analysis  of  variance.  However,  when  the  interaction  effects  arc 
significant,  the  investigator  is  more  interested  in  the  AB  subclass  means 
rather  than  the  class  means.  Since  a  least-squares  subclass  mean  is 

Sij  -  n  +  ai  +  bj  >  (ab)ij, 
ftHe  standard  error  nay  be  computed"  fi*om  the  inverse  matrix  and  a^  as  follows  t 
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In  the  two-way  classification  when  all  subclasses  are  filled  and  the  inter- 
action constants  are  fitted,  the  standard  error  of  the  subclass  mean,  s^^j, 

reduces  to  v   1  a^  ,  The  long  formula  for  the  standard  error  of  the  two-way 

'     e 

subclass  mean  given  above  is  useful  when  all  cells  are  not  filled  or  when  all 
interaction  constants  are  not  fitted  as  well  as  when  other  constants,  such  as 
partial  regressions  or  other  main  effects,  must  also  be  fitted  at  the  same 
time. 

Mean  separation  procedures  can  be  completed  for  the  a.  and  the  b.  in 

the  same  manner  as  described  in  previous  sections.  Procedures  for  obtaining 
sums  of  squares  for  individual  degree-of -freedom-orthogonal  comparisons  among 
the  a   or  among  the  b.  are  also  explained  in  previous  sections.   The  same  pro- 
cedures may  be  used  to  partition  all  of  the  variation  among  the  a.,  the  b. 
and  that  due  to  the  ab   into  single  degree-of-f reedom-orthogonal  contrasts. 
In  this  case,  the  diagonal  inverse  matrix  involving  the  s. .  would  be  trans- 
formed with  the  appropriate  transformation  matrix.  For  example,  suppose  there 
are  two  classes  of  A  and  three  classes  of  B  and  the  following  single  degree 
of  freedom  orthogonal  contrasts  are  desired  among  the  subclass  means: 


^1 

^2 

^3 

^21 

«22 

"23 
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1 

1 

-1 

-1 

-1 
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-1 

-1 
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-1 

-1 

0 

-1 

0 

-1 
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-1 

-1 

-2 

0 

—1 

0 

-1 

A 

B 

AB 


Using  the  rules  discussed  in  a  previous  section,  the  transformation  matrix, 
K,  is 


1_ 
24 


The  transformed  matrix,  T,  is  then  computed  from 

-«nd  the  constants  for  the  orthogonal  contrasts,  c.,  from  KB  -  c  ,  where  B  is 
^  coltmui  vector  of  the  a . .  • 
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The  "within"  subclass  inean  square  is  the  appropriate  error  term  for 
testing  the  significance  of  the  interaction  regardless  of  whether  the  model  is 
fixed,  mixed  or  random.  Likevdse,  this  error  mean  square  is  the  appropriate 
error  term  for  testing  the  significance  of  both  main  effects  in  the  fixed 
model  and  for  the  random  main  effects  in  the  mixed-  model.  There  is  no  exact  test 
of  significance  for  the  fixed  main  effects  in  the  mixed  model  or  for  both  main 
effects  in  the  random  model  when  both  unequal  subclass  frequencies  and  the 
interaction  exist. 

The  expectation  of  the  mean  squares  for  the  random  model  is  as  follows: 

E(M5) 


A 

4  -^  kj^alt, 

+  k^l 

B 

a|  +  k2a|b 

+  k3og 

AB 

4  ^  kl<^lb 

Error 

oi 

Since  k^^,  k2  snd-  k|^  are  all  different  values  when  unequal  subclass  n\iinbers 
exist  (except  when  p=q=2)  even  though  all  subclasses  are  filled,  no  exact  test 
of  significance  existsfor  either  the  a^  or  bj  effects  in  the  random  model. 
However,  this  is  not  a  serious,  problem  since  the  investigator  is  primarily 
interested  in  obtaining  the  best  unbiased  estimates  of  the  variance  com- 
ponents when  all  effects,  except  p,,  are  random  rather  than  in  tests  of  sig- 
nificance. He  will  often  have  a  priori  evidence  that  the  a^  and  b-j  effects 
exist. 

The  coefficients  for  cf^  in  the  mean  square  for  AB,  k^^,  and  the  co- 
efficients for  qb  ^^^  4l>   ^3  ^^d  ^5>  "^7  ^®  computed  by  either  the  direct  or 
the  indirect  procedure  described  in  the  previous  section.  The  other  co- 
efficients, k2  and  k|^,  are  computed  by  the  indirect  procedure.  If  every  AB 
subclass  is  filled  all  of  these  coefficients,  except  k^,  can  be  computed  from 
the  weights  used  in  the  weighted  squares  of  means  procedure  as  described  in 
the  next  sub-section. 


The  expectation  of  the  mean  squares  for  the  mixed  model  with  the  a^ 
effects  random  and  the  b-?  effects  fixed  is  as  follows: 


'J 
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Error 
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k3^ 


The  variance  due  to  the  ai,  ct|,  is  now  estimated  over  a  fixed  set  of  B  effects 
and  therefore  the  interaction  variance  which  gets  into  the  sum  of  squares  for 
A  can  not  be  separated  from  the  main  effects  of  A,  Hence,  the  denominator 
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mean  square  for  testing  the  significance  of  the  A  effects  (over  this  set  of 
fixed  B  classes)  is  the  error  mean  square.  Also,  the  estimate  of  a^  is  ob- 
tained by  computing  only  k^  in  the  mixed  model. 

Since  k^  and  k2  will  differ  when  unequal  subclass  frequencies  exist  no 
exact  test  of  significance  exist  for  the  bj  effects  in  either  the  random  or 
irdxed  model.  The  approximate  test,  in  which  the  mean  square  for  B  is  divided 
by  the  mean  square  for  AB,  decreases  in  desirability  as  the  amount  of  unequal 
frequencies  increase  and  as  the  size  of  agb  increases. 

The  exoectation  of  the  mean  squares  for  the  fixed  model  is  identical  with 
that  given  above  for  the  mixed  model  except  that  k2o|t,  disapoears  from  the 
E(MS)  for  B.  Hence,  all  effects  are  tested  for  significance  by  using  the 
error  mean  square  as  the  denominator  for  F. 

With  the  fixed  model  and  missing  subclasses,  it  must  be  realized  that  all 
estimates  of  constants  and  all  tests  of  significance  resulting  from  the  least- 
squares  analysis  are  relative  to  the  population  of  subclasses  actually  in- 
cluded in  the  data.  Generalization  of  the  results  to  combinations  of  A  and  B 
which  do  not  occur  in  the  data,  of  course,  can  not  be  made.  This  would  be 
true  even  though  the  interaction  effects  for  the  subclasses  available  did  not 
exist  since  no  estimate  of  the  interaction  effects  outside  of  the  data  can  be 
obtained. 

When  data  are  analyzed  under  the  mixed  model,  the  absence  of  AB  sub- 
classes from  the  data  creates  a  very  undesirable   situation  as  far  as  es- 
timation of  variance  comoonents  is  concerned.   In  this  case,  the  variance  due 
to  the  random  effects,  say  a|,  is  estimated  over  a  fixed  set  of  effects  and 
the  absence  of  AB  subclasses  means  that  some  of  the  aj^  effects  would  not  be 
Bieasured  over  all  of  the  B  fixed  classes.  Hence,  it  is  usually  desirable  to 
eliminate  the  A  classes  which  are  not  observed  with  all  B  classes  (when  the  a^ 
are  random  and  the  bj  are  fixed)  or  eliminate  the  B  classes  which  are  not  ob- 
served in  combination  with  all  A  classes  if  the  estimation  of  a^  is  of  primary 
concern.  If  the  primary  interest  in  the  analysis  is  to  obtain  unbiased  es- 
timates of  \L   ■•■  b^  with  minimum  error  when  missing  subclasses  exist,  there  is 
no  need  to  eliminate  any  of  the  filled  subclasses  available  since  the  effects 
of  B  are  measured  over-  a  random  sample  of  ai  effects. 

Short-cut  Procedures 

When  data  are  classified  in  only  two  ways  with  all  subclasses  filled  and 
•the  interaction  must  be  considered,  the  weighted  squares  of  means  procedure 
can  be  used  to  compute  the  sums  of  squares  for  the  main  effects  which  are  ad- 
justed for  the  interaction  effects.  A  short-cut  procedure  can  also  be  used 
to  compute  the  coefficients  of  variance  components  in  the  expectation  of  the 
mean  squares  for  the  main  effects.  However,  the  weighted  squares  of  means 
procedure  does  not  provide  for  the  comoutation  of  the  sum  of  squares  for 
interaction  (AB)  or  a  short-cut  procedure  for  computing  the  coefficient  for 
^ab>  ^3   i^  ^^^   mean  square  of  AB.  The  interaction  sum  of  squares  is  usually 
computed  by  the  general  indirect  procedure  for  computing  a 
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least-squares  sura  of  squares  in  the  two-way  classi'fication,   i.e., 
S.Sqs.~AB  -  Rfp,,ai,bj,(ab)ij]-  R(|i,ai,bj) 

Yf  p-1  q-1 

-  22-r^  -  PL  T..   -     Z     ai(Ti.-Yp.)   -     2     S-.(Y.4-Y.q) 

where  fi,  the  a^  and  the  Sj  are  the  constants  which  solve  the  reduced  least- 
squares  equations  when  the  interaction  effects  are  omitted  from  the  model. 
If  k]^  is  required  it  can  usually  best  be  computed  by  the  general  indirect  pro- 
cedure for  computing  coefficients  of  variance  components,  i.e 

ki  »  i— Y  (n..  -  22  Ri%i) 

•*-       r-p-q+1       ij      ^^ 

where  r  is  the  number  of  AB  subclasses;  the  R^J  are  the  inverse  elements  of 
the  matrix  inverse  to  the  reduced-least-squares-coefficient  matrix  when  the 
interaction  constants  are  omitted  from  the  model;  the  N^j  are  elements  of  the 
"associated"  sums  matrix  computed  from  NN'  as  explained  m  the  previous 
section. 


The  weighted  squares  of  means  procedure  for  computing  estimates  of  the 
|i  +  ai  and  |i  +  bj  the  least-squares  sums  of  squares  for  A  and  B  and  co» 
efficients  of  variance  components  in  the  E(^B)  is  given  below: 
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When  all  subclasses  are  filled,  standard  errors  of  the  a^^  and  bj  may  be  com- 
•puted  directly  from  the  weighting  factors  and  the  estimate  of  the  error  var- 
iance as  follows: 


i. 


■H: 


Sb. 
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/L 


1  a2 


J    p2/2  1/nij 
The  standard  error  of  a  subclass  mean,  Sj^j,  is 

and  the  standard  error  of  the  difference  between  any  two  subclass  means,  say, 


^ij  -  ^i'J.^ 


is 


Hence,  mean  separation  procedures  can  be  applied  easily  to  the  subclass  means. 

If  mean  separation  procedures  are  to  be  applied  to  the  jl  +  a^,  or  the 
\x.  +  Sj,  ©r  if  single  degree-of-freedom-orthogonal  comparisons  are  desired, 
inverse  elements  from  the  matrix  inverse  to  the  variance-covariance  matrix 
are  required.  When  all  cells  are  filled  the  complete  or  partial  matrix  in- 
verse can  be  obtained  by  a  short-cut  procedure.  In  effect,  the  diagonal  in- 
verse matrix  for  the  subclasses,  D-^,  is  transformed  with  the  use  of  the 
appropidate  transformation  matrix,  K,  to  obtain  C-1,  the  matrix  inverse  to 
the  complete  variance-covariance  matrix,  i.e.,  C--  «  KD-^K'.  The  trans- 
formation matrix,  K^  is  the  functional  relationship  matrix  which  gives  the 
association  of  the  s^-;,  the  subclass  means,  and  (1,  the  aj_,  the  bj,  and 
(ab)ij.  For  exaarole,  if  p  ■  2  and  q  ■  3  the  transformation  matrix  is 
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where  t'le  row  coefficients  refer  to  the  ji,  slj  ,  bj,  and  (ab)^^  constants  in 
order  and  the  column  coefficients  refer  to  tne  subclass  means  arranged  in  the 
usual  order,  s^^i,  ^Vlt   ^13>  S21*  ^22>  ^^<^  ^23'  "^^^  formulAs  which  give  -^he 


relationship  of  the  subclass  means  and  the  constants  when  ZSi  -  zS-< 

?(a^)ii  -  --  .       .  J 


0  are  as  follows : 
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The  coefficients  for  the  Si-<  in  these  formulas  are  the  elements  of  the  trans- 
formation matrix  (K),   Coefficients  for  the  transformation  matrix  are  easily 
written  and  could  orobably  be  generated  by  a  high  speed  comnuter  for  almost 
any  combination  of  p  and  q.  The  numerator  coefficients  for  any  (ab).  .  row  can 

quickly  be  obtained  from  the  products  of  the  numerator  coefficients  in  the  ai 
and  bj  rows. 

Since  D"^  is  a  diagonal  matrix  the  copDutations  can  be  arranged  in  a 
systematic  manner,  as  illustrated  in  the  section  on  the  one-way  classification 
with  covariance,  and  only  those  inverse  elements  required  need  be  computed. 
After  computing  the  inverse  elements,  mean  separation  nrocedures  and  com- 
putations required  for  testing  the  significance  of  orthogonal  contrasts  are 
carried  out  as  previously  described. 
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Absorption  of  the  |j,  +  ai  Equations 

When  subclasses  are  missing  in  the  two  way  classification  with  inter- 
action or  if  the  analysis  includes  the  fitting  of  all  constants  for  the  two  way 
iilassification  with  interaction  plus  constants  for  other  effects,  such  as  main 
effects  or  martial  regressions,  absomtion  of  the  jj,  +  ai  equations  into  the 
equations  for  the  remaining  constants  may  be  desirable.  However,  since  con» 
stants  are  being  fitted  here  for  the  (ab)ij  effects  the  proportionate  reduction 
in  size  of  the  matrix  is  not  as  great  as  is  likely  to  be  the  case,  when  inter- 
action constants  are  not  being  fitted.  If  the  absorption  process  is  to  be 
utilized  the  model  should  usually  be  arranged  so  that  p  >  q. 
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The  new  coefficients  in  the  equations  for  the  bj  and  the  (ab)^^  after 
Absorption  of  the  |x  ■»-  a^  equations  are  computed  as  follows: 

C(bjbj)  -  n.j  -  I  Si 


«2 


C[b^(ab)ij]  -  C[(ab)i^(ab)ij  «  "i^  -  ^ 
C[bj(ab)ij,]  .  c[(ab)ij(ab)ij.].  -^|£^ 

cj(ab)ij(ab)i,j]  -  c[(ab)ij(ab)i.  j]  -  0 
The  new  right  hand  members  for  the  b-j  and  (ab)j^^  equations  are 

S(bj).T.j.Z^ 

S(ab)ij  .Ti3-^£ii. 

After  absorption  of  tho  u  "*■  a^  equations,  restrictions  may  be  imposed  that 

2  bj  ■  Z(ab)j^4  "  2(ab)i-j  «  0  and  the  necessary  subtractions  completed  to  re- 

J  J 

move  the  dependencies  prior  to  solution  and/or  inversion.  With  estimates  for 
the  %4   and  the  (afc)i-i  the  original  equations  for  ^  +  aj_  can  be  solved  to  ob- 
tain the  \i.   +  aj^.  With  the  segment  of  the  matrix  inverse  for  the  bj  and 
(ab)i4,  the  inverse  elements  for  the  remaining  segments  of  the  complete  matrix 
inverse  may  be  computed,  if  desired,  with  matrix  multiplication  procedures. 
The  basic  procedure  for  completing  these  calculations  was  presented  in  the 
previous  section  on  the  two-way  classification  without  interaction  and  will  be 
given  in  general  matrix  notation  in  the  next  section. 

Estimation  of  Variance  Components 

When  all  effects  in  the  model  are  random,  except  \i,   two  useful  short-cut 
methods  which  provide  \inbiased  estimates  of  the  variance  components  will  be 
considered.  Both  of  these  methods  are  due  to  Henderson  (1953)^  and  are  re- 
ferred to  by  him  as  Method  1  and  Method  3.  Both  methods  are  based  on  least- 
squares  principles.  However,  the  estimates  obtained  for  the  variance 


6/  Henderson,  C.  R.  Ibid. 
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components  by  the  two  inethods  will  differ  for  any  given  set  of  data  and  both 
sets  of  estimates  will  differ  from  those  obtained  from  the  comolete   least- 
squares  analysis  as  described  above.   The  variances  of  estimates  of  variance 
components  from  data  with  unequal  subclass  frequencies  are  unknown. 

However,  intuitively  it  seems  that  Method  1  of  Henderson  provides  the 
least  efficient  estimates  and  the  comnlete  least-squares  analysis  provides  the 
most  efficient  estimates.   Hence,  where  feasible,  the  variance  component  es- 
timates should  be  obtained  from  the  complete  least-squares  analysis.  In  many 
instances,  however,  the  number  of  equations  is  too  large  to  attempt  the  com- 
plete least-squares  analysis  when  the  interaction  must  be  considered  and  all 
cells  are  not  filled. 

Method  1  of  Henderson  requires  the  least  labor  of  all  three  methods. 
Sums  of  squares  for  the  two  main  effects,  the  interaction  and  the  within  sub- 
class are  computed  in  the  usual  manner  for  balanced  designs  except  that  ap- 
propriate divisors  are  used  for  the  unequal  numbers.   Coefficients  for  the 
variance  components  in  the  mean  squares  (or  sums  of  squares)  are  then  com- 
puted and  the  computed  mean  squares  are  set  equal  to  their  expectation.  The 
res\2lting  set  of  simultaneous  equations  is  solved  to  obtain  estimates  of  the 
variance  components.  In  the  two-way  classification  with  interaction  the  an- 
alysis is  as  follows: 


Source  of 

Variation  d.f . 

A  p-1 

B  q-1 


S.Sqs 


i  Hi    n. . 


j  "-J 


E(M5) 


^l*kU^lb^^5^*^6^l 


AB 


r-p-q+1  22j:2-2-^-2:-^ 
ij  nij   1  ni^   j  n.j 


n. 


4'-^l4h'-^24*^3'^l 


Error 


n..-r  222  yf^v  -  22  Jj. 
ijk  ^^^   ij  Hi 


The  nine  k  values  in  E(M5)  are  computed  as  follows: 

1   ,      ill    Lii  uk 


.^ 


-  2 


r-p-q+1       n..  "  ±     n^^ 


-) 
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r-p-q-t-1   n. 


(1 2 


J  n.J 


h       q-1^^  n..     n.. 


J  "-j 


2n' 


k^.-^cn.. -£:£) 


n.. 

2     *._2 


1  r.lli^.lM^ 


7   p-1  i  n^^ 


n. 

2 


k8 


P-1  i  n^^   n., 
Znf 

1  (n.. -12:) 

p-1        n.« 


Many  terms  are  common  among  these  formulas  for  the  k*s.  Actually,  there  are 
only  five  terms  involving  the  nij,  "i.*  ^•■]i   ^^^  ^'^   that  must  be  computed 
from  the  data, 

Ifethod  3  of  Henderson  is  a  least-squares  analysis  in  which  the  sums  of 
squares  and  expectations  of  the  error  and  AB  interaction  mean  squares  are 
computed  in  the  usual  manner.  The  sums  of  squares  for  the  main  effects,  A 
and  B,  are  computed  from 


S.Sqs.~A  »  R(pi,ai,bj)  -  E(|i,bj) 


J 


and 


■rather  than 


-R(|i,ai,bO  -  2-Ii 
^.Sqs«— B  -  Rdjja^jbj)  -  (\i,a.i) 

— m(|i,ai,bj)  -  I  ^ 

;S^q8. A  -'R[|i,aj^,bj,(ab)i^    -  R[vi,bj,  (ab)^^ 
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S.Sqs.— B  «  Rkt,ai,bj,(ab)i  J  _  R  ||i,ai,  (ab)i-jj 

as  is  done  when  the  complete  least-squares  analysis  (or  weighted  squares  of 
means  analysis,  when  applicable)  is  made.  The  E(KS)  in  the  analysis  of  var- 
iance can  then  be  set  up  as  follows  for  the  random  model: 


A 

B 

AB 
Error 


E(MS) 


oe  +  kia|b  + 


^2"ab 


3^b 


^i  *   ki^^b 


The  coefficient  for  a|^  in  E(MS)  for  AB  is  obtained  in  the  usual  manner 
for  a  least-squares  analvsis,  as  previously  described.  The  coefficients  for 
Ojj  and  Gq^,   k-^  and  ke,   may  be  computed  directly  from  the  inverse  of  the 
square  symmetrical  segments  of  the  matrix  inverse  to  the  variance-co- 
variance  matrix  when  the  interaction  effects  have  been  deleted  from  the  model. 
However,  ki  and  k^  can  more  easily  be  computed  by  the  indirect  method  since 
the  liist  reduction  in  sum  of  squares  used  in  computing  both  A  and  B  reduces  to 
a  "between"  sura  of  squares.  The  formulas  for  computing  kl   and  k^  therefore 
become 


-1.  (n..  - 
q-1 


i  °i. 


2n?j 

kl   -4r  (n..  -  Zi )  . 

5   p-1       ^  n.j 

The  coefficients  for  o^b  i"  E(MS)  for  A  and  B,  k2  and  kj,  are  computed  by 
the  indirect  method  as  follows s 

where  the  R  '5  are  the  inverse  elements  from  the  matrix  inverse  to  the  var- 
iance-covariance  znatrix  for  the  model,  7i-ik  "  ^^  *  *^i  "*"  ^j  '*'  ®ijk*  and  Njlj 
are  the  elements  of  the  "associated  s\ms"  matrix  computed  from  NN'. 
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Numerical  Example 

The  hypothetical  set  of  data  analyzed  in  the  three  previous  sections  to 
illustrate  coniDutational  procedures  will  now  be  analyzed  under  the  two-way 
classification  model  with  interaction.  The  general  least-squares  procedures 
that  would  be  required  with  missing  subclasses  will  first  be  given  for  these 
data  followed  by  the  weighted  squares  of  means  procedures  that  are  applicable 
when  all  cells  are  filled.  The  mathematical  model  for  this  analysis  is 


^ijk 


where 


yijk 

Si 

(sr)ij 
-«ijk 


^1  +  Si  +  Tj  +  (sr)i-j  +  eijk 

i  -  1,  2,  3 

:)  -  1,  2 

k  »  1,  2,  ...,  n^j 

the  gain  of  the  k^h  barrow  on  the  j'th  ration  by 

by  the  i"*^^  sire, 

the  overall  population  mean  with  equal  subclass 

frequencies, 

effect  of  the  i'th  sire, 

effect  of  the  jth  ration, 
interaction  effects, 
random  errors. 


General  Least-Squares  Analysis 
1)  Least-squares  equations 

The  least-squares  equations  for  these  data  are  represented  in  tab- 
ular form  below: 


\i  Si   S2  S3  r^  T2   (sr)ii  i3r)i2   (sr)2i  (s»22  (sr)3i  (sr)32 


RHM 


V^' 

18 

U 

8 

6 

8  10 

2 

2 

5 

3 

1 

5 

9U 

81 : 

U 

U 

0 

0 

2  2 

2 

2 

0 

0 

0 

0 

16 

S2' 

8 

0 

8 

0 

5  3 

0 

0 

5 

3 

0 

0 

U8 

S3: 

6 

0 

0 

6 

1  $ 

0 

0 

0 

0 

1 

5 

30 

n.- 

8 

2 

5 

1 

8  0 

2 

0 

5 

0 

1 

0 

37 

rg: 

10 

2 

3 

5 

0  10 

0 

2 

0 

3 

0 

5 

57 

(sr)ii: 

2 

2 

0 

0 

2  0 

2 

0 

0 

0 

0 

0 

11 

isT)i2' 

2 

2 

0 

0 

0  2 

0 

2 

0 

0 

0 

0 

5 

(sr)2i: 

5 

0 

5 

0 

5  0 

0 

0 

5 

0 

0 

0 

23 

(sr)22* 

3 

0 

3 

0 

0  3 

0 

0 

0 

3 

0 

0 

25 

(sr)3i: 

1 

0 

0 

1 

1  0 

0 

0 

0 

0 

1 

0 

3 

(sr)32: 

5 

0 

0 

5 

0  ^ 

0 

0 

0 

0 

0 

1 

27 
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2)  Imposing  the  restrictions  that  Zs^    =>  Zr-  =  Z(sr),  .  =  ZCs'r)^^  =  0 

i     .1  ^   i    ^^   .1    ^J 


J 


When  these  restrictions  or  constraints  are  imoosed  on  the  constant 
estimates  and  the  appropriate  subtractions  and  additions  are  made  the 
reduced  least-squares  equations  appear  as  follows  when  placed  in  tabular 
form: 


(1  %  S2  r^ 

(s^ii 

(or)2i 

RHM 

\^' 

:       18  -2     2  -2 

h 

6 

9U 

si: 

r       -2  10     6     h 

-li 

-h 

-11 

S2! 

'.         2     6  IL     6 

-h 

-2 

18 

^1' 

:       -2     h     6  18 

-2 

2 

-20 

(sr)ii: 

:         li  -L  -b  -2 

10 

6 

30 

(sr)2iJ 

6  -Ix  -2     2 

6 

lU 

22 

3)  Ifetrix  inverse  to  the  vaidance-covariance  matrix  and  constant  estimates, 


The  inverse  of  the  6x6  symmetrical  coefficient  (or  variance-covar- 
iance)  matrix  is  given  in  the  table  belov: 


{i 

St 

S2 

^1 

(sr)ii 

(sr)2i 

v-' 

.075926 

.007107 

-.031)181 

,018519 

-.018519 

-.029630 

Si: 

.159259 

-.051852 

-.018519 

.018519 

.029630 

S2! 

.120370 

-.029630 

.029630 

.0071j07 

^' 

.075926 

.007Ij07 

-.O3II18I 

(sr)ii: 

.159259 

-.051852 

(sr)2i: 

.120370 

The  inverse  elements  to  the  left  of  the  main  diagonal  are  omitted 
from  the  table  since  they  would  be  the  same  as  those  to  the  right  of  the 
main  diagonal.  Several  of  the  off -diagonal  values  in  the  matrix  inverse 
are  of  the  same  magnitude  although  in  some  cases  they  differ  in  sign. 
Also,  the  sub-matrix  of  inverse  elements  for  the  interaction  constants  is 
the  same  as  the  sub-matrix  for  the  sire  constants.  These  similarities 
occurred  primarily  because  of  onlv  two  classes  for  rations  in  this  set  of 
data  and  will  not  occur  when  p  and  q  are  each  larger  than  three. 

The  constant  estimates  obtained  by  multipl:ying  the  inverse  matrix  and 
the  RHM's  in  the  reduced  matrix  together  are  as  follows: 

PL 

82 


1.8889 

^L  = 

-.5222 

-.8889 

(sr)ii  - 

2.0222 

1.5778 

(s"r)2i  " 

-1.3lljh 

Estimates  of  the  remaining  constants  are  computed  as  follows: 
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S3  »  -(-.6889  +  1.5778)  = 
2-2  =  -(-.5222)   =   .5222 
(sr)3.2   =  -(2.0222)    =  -2.0222 
(sr)22  =  -^1.3Mil)   =  1.3ljhl 


-.6889 


(sr) 


31 


-(2.0222  -  I.3I1I1L1I')  =  -.6778 


(sr)32  =  -(-2.0222  +  1.3L1jI-i)  =  .6778 

Four  decimal  digits  are  carried  in  the  constant  estiirates  to  avoid  ser- 
ious rounding  errors  in  the  comoutation  of  STuns  of  squares.  Obviously, 
these  estimates  are  not  accurate  to  four  decimals. 

h)     Sums  of  squares  for  the  anal^/sis  of  variance. 

S.Sqs Error  =  2ZZy|^j^  -  R[p.,Sj^,r^,  (sr)^^! 

=  568  -  (l.e889)(9M  -(-.88P9)(-ll:) 

-(1.5778)(18)  -(-.5222)(-20)  -(2.0222)(30) 
-(-1.3Mili)(22) 

=  26.0652 

When  constants  are  fitted  for  all  degrees  of  freedom  among  the  subclasses 
the  error  sum  of  squares  iray  also  be  computed  from 

ijk  ^^^   ij  °1J 
With  this  example, 

S.Sqs. Error  =  568  -  511.9333  -  26.0667 

which  checks  (within  rounding  error)  with  the  sum  of  squares  obtained  for 
error  using  the  general  computing  method  above.  It  will  be  noted  that 
this  is  the  same  sum  of  squares  that  was  used  for  error  in  the  two-way 
classification  without  interaction  for  testing  the  significance  of  the 
interaction  effects. 

The  sums  of  squares  for  sires  (S),  rations  (R)  and  the  interaction 
(SR)  are  computed  by  the  general  direct  procedure  (B'Z'^B)  as  follows: 


S.Sqs. V 


159259    -.051852 
051852      .120370 


[-.8889     1.5778|  I   • 

[-.8889    1.5778]  M^^    ^-^^^^^^ 
|3.11j6096     9.662967 

[-1.5281     12.1-197]  \l\f^jl\ 
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S.Sqs. R  - 

S.Sqs. SR  - 


21.0015 

(-.5222)^ 
.075926 


3.5916 


[2.0222     -1.3lililjl      • 

flO.5393     -6.6289]  ll-°lli\ 
30.22U5 


159259     -.051852 
051852      .120370 


-1 

1       [2.0222! 
L-l.3lihlj] 


Except  for  rounding  errors,   it  will  be  noted  that  the  sura  of  squares  ob- 
tained for  interaction  in  this  manner  is  the  same  as  that  computed  in  the 
last  section  from 

y? 

2Z  -Al  -  R(|i,Si,r^)-. 


However,  it  should  also  be  noted  that  the  sums  of  squares  for  S  and  R 
above  are  considerably  different  from  those  obtained  for  these  sources  of 
variation  when  the  interaction  effects  were  disregarded.  Also,  of  course, 
the  constant  estimates  for  the  s^  and  r-*  are  different  in  the  two  an- 
alyses. Still  another  point  worth  noting  in  the  two  analyses  is  the 
difference  in  the  diagonal  inverse  elements  for  y,,  s^^,  53  >  and  rj^.  When 
adjustment  must  be  made  for  the  interaction  effects  these  inverse  elements 
are  always  larger  unless  equal  subclass  frequencies  exist.  The  only 
factor  causing  these  inverse  elements  to  be  larger  and  the  constant  es- 
timates and  the  sums  of  squares  for  S  and  R  to  differ  is  the  unequal  sub- 
class frequencies.  The  distribution,  and  not  the  magnitude  of  the  inter- 
action effects,  is  the  only  factor  involved  in  making  these  adjustments. 

When  both  the  s^  and  the  ra  are  fixed  the  analysis  of  variance  may 
be  completed  as  follows: 

Source  of  Variation  d.f,   S.Sqs.     M.S.    F 


S 

R 

SR 

Error 


2  21.0015  10.5008  li.83* 

1  3.5916  3.5916  1.65    n.s. 

2  30.22h5  15.1122  6.96* 
12  26.0652  2.1721 


^Indicates  significance  at  the  .05  level  of  probability, 

5)     Least-squares  means,    standard  errors  and  individual  comparisons. 

The  least-squares  means,  which  may  be  of  interest  in  an  analysis 
such  as  this,   are 

|i  "  h.9 The  overall  mean  expected  with  equal  numbers, 

jl  "♦■  8^  -  li.O  -  -  -  Sire  No,  1  mean. 

ii  +  82-6,5 Sire  No.  2  mean. 
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(i.  +  So  ■  U«2  -  -  -  Sire  No.  3  mean. 

(1  +  r^^  -  h»Ii  -  -  -  Ration  No,  1  mean. 

p,  +  rp  "  5.1'  -  -  -  Ration  No.  2  mean. 
p.  •'^  %  +  ri  +  (sr)]_]L  "  5.5  -  -  -  Sire  1  -  Ration  1  subclass  mean. 
\L  ■*■  82^  +  T2  +  (sr)^2  ■  2.5  -  -  -  Sire  1  -  Ration  2  subclass  mean. 
^  +  S2  +  T^  +  (^^21  "  ^*^  -  -  -  Sire  2  -  Ration  1  subclass  mean. 
(i  H  Sp  +  r2  (sr)22  "  8.3  -  -  -  Sire  2  -  Ration  2  subclass  mean. 
p.  ■«•  S3  +  r^  +  (sr)32_  «  3.0  -  -  -  Sire  3  -  Ration  1  subclass  mean, 
(1  +  'so  +  r2  ■♦■  (sr)32  »  5.l4  -  -  -  Sire  3  -  Ration  2  subclass  mean. 

The  standard  error  for  each  of  these  means  may  be  computed  from  the 
elements  of  the  inverse  matrix  and  the  estimate  of  cr|,  2.1721,  from  the 
analysis  of  variance  in  the  usual  manner  which  has  previously  been 
illustrated.   Pairvd.se  tests  of  significance  among  the  sire  means  or  among 
the  subclass  means  may  be  completed  with  the  "t"  test  or  Duncan's  Multiple 
Range  Test  with  the  information  now  available  as  have  also  previously  been 
illustrated.   The  method  of  computing  sums  of  squares  for  single  degree- 
of -freedom-orthogonal  contrasts  was  also  described  and  illustrated  in  the 
earlier  sections. 

6)  Estimation  of  variance  components. 

Although  the  number  of  degrees  of  freedom  for  sires  and  rations  are 
wholly  inadequate  to  obtain  accurate  estimates  of  variance  components  for 
these  two  main  effects  or  for  the  interaction,  the  computational  procedures 
required  to  obtain  these  estimates  will  be  illustrated  with  this  small  set 
of  data. 

If  the  effects  of  sires  and  rations  are  both  regarded  as  random.iZ/ 
the  expectations  of  the  mean  squares  in  the  analysis  of  variance  are  as 
follows : 


d.f. 
p-1 

E(M5) 

s 

^1  "  kh^lr  "^  k^aj 

R 

q-1 

<^l  *  ^2^ST  +  ^3^v 

SR 

r-p-q+l 

^l  ^  k^af^ 

Error 

n..-r 

-1 

The  coefficient  for  a|j.  in  the  SR  mean  square,  ki,   and  the  co  _ 
efficients  for  Op,  and  Og,  k^  and  k^,  may  be  computed  by  the  direct  method 


7/  The  effects  of  rations  would  probably  never  be  regarded  as  random  in  prac- 
tice. This  is  done  here  merely  to  illustrate  variance  component  estimation 
with  the  random  model. 
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from  the  Z"-*-  matrices  that  were  used  to  compute  the  sums  of  squares. 
These  computations  are  shown  below: 

7.303395  +  9.662967  -(§) (3.116096) 


13.820266  ^  2.303 
6 


,    1/. 075926  _  13.17072   ,   ^^^ 
jj.  -  -L — „  6.585 


^5 


7.303395  +  9.662967  -  (4) (3.11x6096) 


13.820266 


)4.607 


The  coefficients  for  ajj.  in  the  E(^B)  for  R  and  S  are  computed  by  the 
indirect  method  as  follows: 

k2  -  I  (18  -  2ZR^^%j) 

18  -2  2  li  6 

-2  10  6  -li  -I4 

2  6  U  -li  -2 

li  -li  -h  10  6 

6  -h  -2  6  Ih 

r0711i09     .01192li     -.021.251     -.020326     -.021952] 

.15U71j2     -.059079       .020326       .021952 

^108807       .032521     -.OOL878 

.158536      .OI18761 

.107317 

when  the  off -diagonals  on  the  left  are  omitted. 


»-l 


N 


ij 


NM»  = 


2 

2 

5 

3 

1 

5 

2 

2 

0 

0 

0 

0 

0 

0 

5 

3 

0 

0 

0 

0 

0 

0 

1 

5 

2 

0 

0 

0 

0 

0 

0 

2 

0 

0 

0 

0 

0 

0 

5 

0 

0 

0 

0 

0 

0 

3 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

5 

N' 
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68 


Si     S2  S3  (sr)^!  (st)i2   (6r)2i  (sr)22  (sr)3-,  (sr) 


31  ^-=^02 


8  3lj  26 

8   0  0 

~  3li  0 

"~  26 


h 
h 
0 
0 
li 


h 
h 
0 
0 
0 


25 
0 

25 
0 
0 
0 

25 


9 
0 

9 
0 
0 
0 
0 
9 


1 
0 
0 

1 

0 
0 
0 
0 

1 


25 
0 
0 

25 
0 
0 
0 
0 
0 

25 


when  the  off -diagonals  on  the  left  are  omitted. 

The  reduced  Nj^  matrix,  after  subtracting  by  rows  and  columns,  is 

^68  -18   8   2li   liO" 

3U  26  -21  -2h 

60  -2la   -8 

3li   26 

60 

when  the  left  half  is  again  omitted. 

Hence, 

k2  -  [18  -  (68)(.071L09)  -  (-18)(.01192L) (60)(.1073l7)| 

=  18  -  l5.80L5lli  =  2,195 

The  calculations  involved  in  computing  k}^  by  the  indirect  method  are 
as  follows: 

ki,  - 1  (18  -  m^J%j) 

18  -2  I  6" 

-2  18  -2  2 

U  -2  10  6 

6  2  6  Ih 


j-1 


.067106       .009363     -.009363     -.026217 
.0611x86       .02l8h8     -.022160 
.lUj8l9     -.06117U 
.11201:7 
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N^j    =  NN» 


2  2  5  3  15 
2     0     5     0     10 

020305 
2  0  0  0  0  0 
0  2  0  0  0  0 
005000 
000300 
0  0  0  0  10 
000005 


N» 


V-    TL     ^2   ^^^^11   (sr)i2  ,(sr)2i   (sr)22   (5^)31   (sr)32 


68  30  38 

h 

h 

25 

9 

1 

25 

30  0 

h 

0 

25 

0 

1 

0 

38 

0 

h 

0 

9 

0 

25 

k 

0 

0 

0 

0 

0 

h 

0 

0 

0 

0 

il 

0 
9 

0 
0 

1 

0 

0 

0 

25 

Hence, 


% 


68     -8       2h     ho 

68     -18      8 

31     26 

60 


I  (18  -  13.39327h)  =  |  (U. 606726) 


2.303 


The  reduced  N   matrix,  after  subtracting  by  rows  and  columns,  is 


Use  of  the  indirect  method  to  comnute  k2  and  k[^  involves  a  great  deal  of 
arithmetic  even  when  the  number  of  classes  for  both  A  and  B  is  small.  A 
shorter  and  more  direct  method  should  be  available  to  compute  these  co- 
efficients. Perhaos  one  will  be  discovered  in  the  near  future.  Until 
then,  the  use  of  the  complete  least-squares  analysis  for  estimating  var- 
iance comoonents  when  interactions  must  be  considered  and  subclasses  are 
missing  will,  no  doubt,  be  limited. 


Estimates  of  the  variance  components  may  now  be  computed  from  the 
complete  least-squares  analysis  as  follows: 


Source  of  Variation 
S 

R 

SR 

Error 


M.S. 
10.5008 

3.5916 

15.1122 

2.1721 


a| 


E(jg) 
+  2.303  a|r  ^  h.607  a| 
+  2.195  a|r  +  6.585  0} 
+  2.303  air 


-76. 


>!s«P 


!"'^^¥^^'^^'^-^' :  ■  '"-^"!^'f  ^'^MJ^l^^'^m^U^mM^^^  -  - ' 


awh'riii'i 


mismiii&UUmiiiiisA 


mvmmimf  !'ii'«minu^"'Aii>xii  vi^iu.m^m tomwunnfrnmrn 


KillliiiltKi  I I    1 '    '  i|  mill'   I  '     «vtMi:,Mia^»^' '»»"«' » t;W-~- ■>"«•«>•'— <wii>'*»t9i^«'',^gt'"' 


.<»i->r"-*-"^'''''''^'*^"'""' 


—  ~ s.     ri..^^n»&i.    nnlw       RAnrndUCtton    OJ     fhin    IJom    Mav    Cnnatlintti    r.nriMFlnht    tnfrlnn^manf 


2.172 
5.618 


*2 


-1.657 


cr|  »  -1.001 


The  variance  of  these  variance  component  estirnates,  of  coiirse,  is  ex- 
tremely large.   Many  more  degrees  of  freedom  are  needed  for  both  S  and  R 
in  order  to  obtain  reliable  estimates  of  the  variance  components.  The 
variance  component  analysis  for  these  data  is  presented  merely  to  illus- 
trate the  computational  procedures. 


Weighted  Squares  of  Means  Analysis 

When  all  degrees  of  freedom  among  a  set  of  subclasses  are  partitioned 
into  a  set  of  orthogonal  comparisons,  as  is  done  in  factorial  analyses, 
the  estimates  of  all  constants  are  determined  from  the  subclass  means.  In 
this  case,  weights  are  easily  determined  that  are  useful  for  computing 
sums  of  squares,  variances  of  constant  estimates  and  coefficients  of  var- 
iance components.  Steps  required  in  the  application  of  the  weighted 
squares  of  means  procedures  to  the  analysis  of  the  data  in  the  numerical 
example  will  now  be  given. 

(1)  Computation  of  constant  estimates. 

\i  +  sx 


11  +   83 

ji  +  ri 


Si 

«3 


q 
5.5+2.5 


h.oooo 


u.6000+8.3333 


6.U667 


3.0+5.1x 


f»ij 


U.2000 


5.5+I1.6+3.O 


U.3667 


2, 5000+8. 3333+5.1x000  ^  ^^i^n 

U. 0000+6. Ii6667+Ii. 2000  ^  L. 3667+5. hill 

3  "  2 

U.OOOO  -  It. 8889  »  -.8889 
6.1667  -  ii.8889  -  1.5778 
U.2000  -  I4.8889  -  -.6889 


1.3889 
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r^  =  h.3667  -  1.8889  -  -.5222 
rg  =  5.lail  -  U.8869  =  .5222 


(2) 

Computation  of  weights. 

u  .    ^' 

"^   2  1/ni,. 

J 

li 

w^  =  r— r  "  4.0000 
2  2 

W2  =  ^^-^  =  7.5000 

5^3 

W3  =  -^  =  3.3333 
3   1.1 

P^ 


p7^ 


V2  =  .j-^  »  8.7097 


2   3  5 


(3)  Computation  of  sums  of  squares. 

S.Sqs,  Error  »  Within  Subclass  S.Sqs,  =  26.0067 

S.Sqs. SR  =  30.22h5 

The   sura  of  squares  for  interaction  must  be  comouted  from  the  least- 
squares  analysis,   i.e.,   B'Z~^B  or  Rr[i,ai,b-j,  (ab)ij|-  R([i),ai,b-j) , 

S.Sqs R  -  (li. 0000) (-.8889)^  +  (7.5000)  (1.5778)^  +(3.3333)(-.6889)^ 

-  B^«0000)(-»8fi89)  +(7.5000)(1.5778)  -^(3.3333)(-.6889}]  ^ 
It. 0000  +  7.5000  +  3.3333 

=  21,0013 

S.Sqs. S  -  (5.29la)(-.5222)^   +(8.7097)  ( .5222f 

[(5.29Ll)(-.5222)   -»•( 8. 7097) (.5222)1  ^ 
5.2911  +  8.7097 

«  3.5916 

It  will  be  noted  that  these  sums  of  squares  for  R  and  S  agree  within 
rounding  errors  with  those  obtained  by  the  general  least-squares 
procedure,  B'Z"*^B. 

(li)  Computation  of  k's. 

As  pointed  out  previously,  k^  must  be  computed  by  the  general 
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I 
I 


least-squares  procedure  even  though  all  subclasses  are  filled.     All 
other  k's  may  be  computed  from  the  weights  as  follows: 

1        r^t^  90M^R  7no7>       (5.29Ul)^-^(8. 7097)^1 
^2  -  j^j^  j^(5.29hl*8.7097)  -     ^.2911^8.7097 J 

.  i  (1,.0038  .  123^) 

„  1  (6.5851)  -  2.195 
3 

k3  -i  (6.585Ii)  -  6.585 

V,    -       ^        l-lh  8-^1^       (U.OOOO)^  -^(7.5000)^  ^(3.3333)^1 
^i*      T2K27  P-^^^^ Ii:8533  J 

.  i  (9.2135)  =  2.303 

k^  -  I  (9.2135)  »  I1.607 

It  will  be  noted  that  these  k  values   (k2,ko,ki  ,  and  k^)coiaputed  in  this 

manner  agree  with  those  obtained  by  the  general  least- square 3  pro- 
cedure.    When  all  subclasses  are  filled 


and 


kg    «  -  kri 

^      p    ;> 


Short-cut  Procedure  of  Coinputing  the  Ifetrix  Inverse 

When  all  degrees  of  freedom  among  a  set  of  subclasses  is  par- 
titioned into  a  set  of  orthogonal  comparisons,  the  complete  matrix 
inverse  to  the  variance-covariance  matrix  can  be  computed  from 

where  K  is  the  transformation  matrix  and  D~l  is  the  diagonal  inverse 
matrix  for  the  set  of  subclasses.  In  the  present  problem  these  com- 
putations are  simplified  with  the  aid  of  the  following  table: 
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i^  K*     l~  (.I.  .  . 


Constant 

Estimates 


Tl     sJ~iJTJTsF]3JTsr)^ 


Me&n 


D-1 


^11 

1 

2  -1 

1 

2 

S12 

1 

2  -1 

-1 

-2 

S22     o 

1 

1 

-1     2 
-1     2 

-1 

-1 
1 

S31 

1 

-1  -1 

1 

-1 

S32 

1 

-1  -1 

-1 

1 

5,5000 

.500000 

11.3889 

2.5000 

.500000 

-.8689 

L,6000 

,200000 

1.5778 

8.3333 

.333333 

-.5222 

3.0000 

1,000000 

2.0222 

5.b000 

.200000 

-1.3liljJa 

The  transr>ose  of  the  transformation  niatrix   (K^)  ap-oears  on  the  left 
in  the  table.     The  values  in  the  D~-  colujrm  are  the  reciorocals  of 
the  sijbclass  numbers.      The  constant  estirnates  are  comouted   from 

^  =  ^  (5.5000  +  2.5000     +  +  +  5.1000)   =  1.8889 
h.^Z  [(2)(5.5000)+  (2)(2.5000)-  (l)(li.6000  —  (i)(5.looo)J 

=.  -.8889 

etc. 

The  inverse  elements  of  the  matrix:  inverse  to  the  variance-covarianc$ 
matrix  nay  be  conrouted  one  at  a  tiire  as  follows: 

CW-  "  2.  j(l)^(. 500000)   +(l)^(. 500000)   +  +  +  (1) ^(,200000)1 

=  ,075926 
(?AS1  =2-  [(1) (2) (.500000)  +(1) (2) (.500000)  +  +  +  (!)(-!) (.200000)] 

-  .OOTliOy 
CP^2  =^  [(1)(-1)(. 500000)   +(1)(-1)(.500C00)   +  +  +  (1)(-1)(. 200000)] 

=  --0311181 


^sraisrii  ^    1    |(.1)2(. 500000)   +(1)  =  (.500000)  +  ^  ^  (l)^(.2OOO00)] 


,020370 


Jcpplication  of  Method  1  of  Henderson 


Tfftien  this  method  is  nsed  to  obtain  estiirates  of  the  variance  comoonents, 
iJie  sums  of  squares  for  the  analvsis  of  variance  are  computed  as  follows: 
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S,5q6,_S  .  ii^  -  iM2f  .  1302!  ^  i2kl 
^'  h  8  6  18 


^02  -  J49D«SS69  -  11.1111 

2 


S  SOS  R^iiaf+ilZi!        (9)0 

-  Ii96.0250  -  li9C.8889  -  5-1361 

(11)^       (5)^  (27)2 

S-Sqs. 5R     «     -^^    -*•  -—-  -f  -f  +  -^—^    -  502  -  496.0250  +  490.8889 

*  5lil*$333  -  507.1361  -  3ii.7972 

S,Sg5 Error    *  568  -  51il.9333  »  26.0667 

The  analysis  of  variance  is  as  follows: 

Sotirce  of  Variation     d-f-       S.Sqs^         H.S.  E(MS) 

S 
R 


2       11.1111       5.5556     a|  +  kyO^j.  +  kQo}  +  k^of 

1  5.1361       5.1361     a|  +  k^a|j.  +  k^c^  +  k6o| 

2  3I1.7972     17.3986     a|  +  k^^dp   +  kgCjr^  +  k^c^^ 

.2 
e 


Error  12      26,0667      2,1722     o^ 


The  terms  required  for  the  coinputation  of  ihe  k  values  for  E(MS)  are  computed 
as  follows: 

i    J-J  ^o\2Wo\2 

2 


X-'      inlll^  .  (£llli2]Z  ^  mltiS}!  ^  10.SQ33 


i 


H. 


l^i 


8 


^     "'J  0  10 


1^.01^82^6)^ 
18 


S.hhhh 


2n 


n«. 

2 


n. 


^  ,  (8)^-^(10)^ 
18 


9.1111 


1^ 


(2)"^(2)^-K(5)M3)^^(l)^^(5)^  .  3  7778 
18 
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The  nine  k*s  are  now  computed  as  follows: 

ki  -  I  (18  -  10.5833  -  7.5500  +  3.7778)  =  1.8222 

k2  «  I  (9.1111  -  10.5833)  -  -.7361 

w  =  1  (6.ll:l4li  -  7.5500)  =  -.5528 
^       2 

%=l   (7.5500  -  3.7778)  =  3.7722 
k^  »  i  (18  -  9.1111)=  8.8889 
k5  =  i  (7.5500-  6.[i)aUi)  =  1.1056 
ky  =  I  (10,5833  -  3.7778)  =  3.U028 
k8  =  I  (10.5833  -  9,1111)  =  .7361 
k^  -  i  (18  -  6.II1II1)  =  5.7778 

When  the  computed  mean  squares  are  set  equal  to  the  extjected  mean  squares  and 
the  resulting  equations  are  solved,  the  following  estimates  for  %he   variance 
components  are  obtained : 

a|  =  2.172    ^  =  -2.087 


<7sr 


=  6.593    al   -  -3-031 


The  estimates  of  o^^'t  ^  ^^^   ^s  ^°  ^^'^   agree  with  those  obtained  from  the  com- 
plete least-squares  analvsis.  Although  these  estimates  probably  have  larger 
sampling  errors  than  those  obtained  with  the  complete  least-squares  analvsis 
they  are  unbiased  provided  all  effects  in  the  model,  except  |i,  can  be  re- 
garded as  random. 

Application  of  Method  3  of  Henderson 

With  this  method  the  sums  of  squares  for  error  and  the  interaction  re- 
main the  same  as  with  the  complete  least-squares  analvsis.  Also  the  co- 
efficient for  a|p  in  the  expectation  of  the  interaction  mean  square,  ki,  re- 
mains the  same.  The  sums  of  squares  for  S  and  R  are  computed  as  follows 
nnder  this  method: 

^•Sqs. S  «  R(p.,Si,rj)  -  R(p.,rj) 

-  511.7036  -  I196.O250  -  15.6786 
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mtmni^^ 


S.Sqs. R  »  R(fi,Sj,,rj)  -  R(|i,Si) 

'  -  511.7036  -  502.0000  =•  9.7036 


It  should  be  noted  that  these  are  the  same  sums  of  squares  that  were  computed 
for  S  and  R  when  the  interaction  was  assumed  to  be  non-existent.  The  co- 
efficients for  a|  and  Op  in  the  expectation  of  the  mean  squares  for  S  and  R 
(k^  and  ki)  resoectivelv,  also  remain  the  same  as  when  interaction  was  dis- 
regarded. The  coefficients  for  o^j.  in  the  expectations  of  the  mean  squares 
for  S  and  R,  k.'  and  k'  are  computed  as  follows: 


k' 


1    IJ 


Zn 


k'    =  i(ZZR^JNi^  ^  Z  -^ 


ij 


Z  -^ ) 


1(ZZR^^N.  .  -  Z 
2  ij         ^^       j 


& 


n. 


R 


18-2       2-2 

10       6       li 

Ih       6 

18 


>-l 


,06llje6     .0218L8 
.ILL819 


,022160 
.061171: 
,1120L7 


.009363 
-.009363 
-.026217 

.067lil6 


N 


ij 


NN» 


2  2  5 
2  2  0 
0  0  5 
0  0  0 
205 
0     2 


1 

5 

0 

0 

0 

0 

1 

5 

1 

0 

0 

5j 

N' 


030 


68     8     31     26     30     38 
8      0      0      b       It 

31       0     25 
26       1 

30 


9 
25 

0 
38 


"Mien  the  appropriate  subtractions  and  additions  are  made  by  column  and  row  N 
reduces  to: 
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68     -18       8     -8 

3U    26     2li 

60     ho 

68 


Hence, 


k^  «  13.3933  -  10.5833  =  2.S10 
H  "  I  ^^^'^^^^  "  7.5500)   =  2.922 


The  analysis  of  variance  ^dlth   the  expectations  of  the  mean   squares  can  now  be 
set  up  as  follows: 


Source  of  Variation     d.f.        S.Scs 


M.S. 


E(MS) 


s 

2 

15.6786     7.8393 

a|  +  2.922  a|r  +  5.225  a| 

R 

1 

9.7036     9.7036 

aj  +  2,810  a§r  4-  7.U7  0^ 

SR 

2 

30.2297  15.III18 

cl  +  2.303  a|j. 

Error 

12 

26.0667     2.1722 

a| 

Estimates  of  the  variance  corrroonents  when  corrrouted  bv  Method  3  of  Henderson 
Sltq  as  follows: 


aj  =  2.172 
afr  =  5.620 


aj  =  -l.llli 


ol   =  -2.058 


Three  methods  of  comouting  unbiased  estirrates  of  variance  components 
under  the  random  model  have  been  illustrated  with  this  small  numerical  ex- 
ample. Presumably,  estimates  of  variance  components  obtained  from  a  comolete 
least-squares  analysis  have  smaller  samr)ling  errors  than  those  obtained  with 
either  Methods  1  or  3  of  Henderson.  However,  short-cut  procedures  are  still 
needed  before  either  the  complete  least-squares  analysis  or  Jfethod  3  of  Hen- 
derson will  be  feasible  for  manv  sets  of  data. 
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MULTIPLE  Aro  NESTED  CUSSIFICATIONS 

Most  sets  of  animal  breeding  data  with  disproportionate  subclass  numbers 
are  classified  in  more  than  two-ways.  Often,  this  is  also  true  with  other 
types  of  data  where  unequal  subclass  numbers  exist.  Many  of  the  computational 
procedures  presented  in  the  previous  sections  are  directly  applicable  to  any 
least-squares  analysis.  On  the  other  hand,  special  techniques  are  often  re- 
quired to  complete  least-squares  analyses  which  involve  multiple  classifi- 
cations and  the  combination  of  multiple  and  nested  classifications.  In  this 
section  will  be  presented  some  cf  these  special  techniques  and  short-cuts  and 
a  computational  example  which  includes  multiple  classifications,  a  nested 
classification,  interaction,  and  two  partial  regressions  in  one  analysis.  In 
addition,  the  general  least-squares  analysis  is  presented  in  matrix  notation. 

Models 

Although  there  are  many  specific  designs  for  vrhich  the  linear  mathematical 
models  are  standard,  the  most  appropriate  for  use  in  the  analysis  of  any  given 
set  of  data  may  differ  considerably  from  the  standard  models.  Since  the 
accurateness  and  validity  of  conclusions  drawn  from  any  analysis  arc  highly  de- 
pendent upon  how  accurately  the  selected  mathematical  model  describes  the 
biology  involved,  the  importance  of  selecting  the  appropriate  model  can  not  be 
ovBreitiphasized,  On  the  othar  hand,  it  is  doubtful  that  nil,  assumptions 
necessary  for  tests  of  significance  ana  estimation  procedures  to  be  strictly 
valid,  from  a  mathematical  viev/point,  will  ever  be  entirely  satisfied.  Never- 
theless, every  effort  should  be  made  to  select  the  model  which  does  allow  the 
assumptions  required,  with  respect  to  homogeneity  of  variance  and  independence 
of  errors,  to  be  most  near!]^  satisfied.  At  the  same  tim.e,  however,  the  model 
should  not  be  so  complex  that  the  analysis  can  not  be  completed  at  a  reasonable 
cost. 

When  unequal  subclass  numbers  exist  and  constants  are  fitted  for  a  set  of 
effects  (either  main  effects,  interaction  effects  or  regression)  which  really 
has  no  influence  on  the  variability  in  y,  the  constants  for  other  sets  of 
effects  that  are  estimated  at  the  same  time  and  the  tests  of  significance  for 
such  effects,  although  unbiased,  may  be  very  inefficient.  Hence,  in  the 
selection  of  the  model  for  the  analysis  it  is  important  to  exclude  from  the 
model  sets  of  effects  w?n.ch  are  known  not  to  effect  the  variability.  At  the 
same  time,  however,  it  is  important  to  include  in  the  m.odel  .qlT  effects  which 
really  do  effect  the  variability  of  y  in  order  that  the  estimates  obtained  in 
the  analysis  will  be  unbiased.  This  places  the  investigator  in  somewhat  of  a 
quandary.  A  good  rule  to  follow  is  to  place  a  set  of  effects  in  the  model  when 
some  doubt  exists  concerning  whether  such  effects  are  really  zero  or  not. 
Methods  are  available  (and  will  be  presented  in  this  section)  for  deleting  a 
set  of  effects  after  it  has  been  found  that  the  constants  for  the  effects  with- 
in the  set  probably  equal  zero. 

The  specific  model  for  consideration  at  present  is 
yijkl  =  a  +  ai  +  bij  +  ck  +  (ac)ik  +  dDijkl  +  gGijkl  +  eijkl (5) 
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i  -  1,   2,    ..,,  p 

1 

j  -  1,   2,    ...,   q^ 

s 

k  "  1,   2,    ...,  r 

t 

1,  2f    •••s   ^ijk 

Mo,  of  AB  subclasses 

No,  of  AC  subclasses 


where: 


yijkl 


^k 
d 

l>i;5kl 
g 

^ijkl 
eijkl 


,th 


k^h  C 


class  and  in  the  j^"  B  class 


th 


the  1   observation  in  the 
within  the  i'th  ±   class, 
the  population  mean  when  equal  frequencies  exist  in  all  sub- 
classes and  Diiki  =  ^ijkl  ''  ^* 
effect  of  the  ±'^^  A  class. 

effect  of  the  ^^^  B  class  within  the  i'''^  A  class, 
effect  of  the  k'th  c  class. 


partial  regression  of  the  yiikl  ^^^  "^^Q  ^iikl* 
an  independent  continuous  variable, 
partial  regression  of  the  yiikl  o^i  the  Gi-ikl» 
a  second  independent  continuous  variable, 

randora  errors  which  ar3  assumed  to  be  lIID(0,o|).  The 
normality  assumption  is  required  to  validate  tests  of  signi- 
ficance. 
The  population  mean  when  equal  frequencies  exist  is 

p.=  a  +  dD  +  gG 
where  D  and  G  are  the  means  of  the  Dijkl  ^^*^  ^^^   ^ijkl*  However,  the  popu- 
lation mean  for  any  combination  of  values  for  D  and  G  can  be  computed  if  a,  d, 
and  g  are  known. 

The  model  ($)   does  not  include  a  term  for  the  interaction  of  the  h±^   with 
the  c^^  (tc)ijk:>  which  could  exist.  It  is  asstnned  here  that  a  priori  infor- 
mation clearly  indicates  that  the  (bc)j_j}c  effects  equal  zero.  However,  in  the 
completion  of  the  least-squares  analysis  under  model  (5)  a  procedure  will  be 
given  for  testing  the  significance  of  the  (bc)i-jk  interaction  effects.  If  the 
a  priori  information  was  incorrect  and  the  interaction  (BC:A)  is  significant 
the  estimates  of  constants  for  the  other  effects  are  biased. 

The  procedures  to  be  followed  in  the  analysis  of  a  set  of  data  under  model 
(5)  depend  on  several  factors.  If  all  effects  in  the  model,  except  the  randora 
errors,  are  regarded  as  fixed  a  complete  least-squares  analysis  is  preferred. 
In  this  case,  the  inverse  of  the  complete  variance-covariance  matrix  would  be 
req\iired  if  individual  comparisons  are  to  be  made  among  least-squai^s  means. 
Also,  if  all  effects  are  fixed  the  number  of  independent  least-squares  equat- 
ions is  not  likely  to  be  large  and  therefore,  the  complete  analysis  is  not  too 
difficult  to  accomplish.  However,  the  effects  for  a  nested  classification, 
such  as  bj_j,  usually  represent  a  random  sample  of  those  effects  and  not  a 

selected  group.  In  this  case,  the  number  of  equations  is  likely  to  be  large 
and  direct  inversion  of  all  independent  equations  may  be  impractical.  The  mere 
fact  that  a  large  number  of  original  least-squares  equations  exist  does  not 
prohibit  a  least-squares  analysis  from  being  completed.  Short-cut  methods  are 
available  which  allow  the  investigator  to  obtain  unbiased  estimates,  and  in 
many  cases  the  most  efficient  estimates,  of  gll  constants  and  most  tests  of 
significance  desired  even  though  the  number  of  equations  is  large, 
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Oftentimes  when  model  (5)  is  appropriate  both  p  and  r  are   small  but  the  qj_ 

are  large,  ^i/hen  this  is  true  tlie  equations  for  a  +  a^  +  b^-j  should  fjjrst  be 

absorbed  into  the  equations  for  c-^y    (ac)j^j^,  d,  and  g  even  though  the  inverse  of 

the  complete  variance-covariance  matrix  is  desired.  In  effect,  the  inverse  of 
the  complete  coefficient  matrix  is  then  obtained  more  easily  by  a  matrix- 
partitioning  procedure  than  if  the  complete  matidx  inverse  is  obtained  by  di- 
rect inversion.  In  order  to  present  these  procedures  (as  well  as  othei^)  in 
-  general  terms  which  apply  to  any  least-squares  analysis,  model  (5)  will  be 
given  in  vector  notation  and  then  associated  with  general  matrix  notation. 
Model  (5)  in  vector  notation  may  be  written  as  follows: 

y  =  a+A  +  B  +  C  +  AC  +  dD+gG  + 


where 


/dDiin 
'^1112 


Wpqmpqr y         \^'    Vpl    \pqj   ^r/    \^^^^pry   \     pqmpqr 


'pqmpqW      ^"pqmpqj.^ 

A  general  form  for  any  model  is         "^ 

y  =  XB^^  +  ZB2  +  e,. (6), 

If  p  =  2,  q^  "  3  for  all  i,  r  =  2  and  Hj^^j^  =  1  in  all  cases  for  an  analysis 
under  model  (5) 


X 


and 


0     0  0  (\ 

0     0  0  0\ 

0     0  0  0 

0     0  0  0 

10  0  0 

0  0  0 


1 
1 
0 
0 
0 
0 


0  0 

0  0 

1  0 
1  0 
0  1 
0  1 


h. 


^a  +  ai  +  bii> 

a  +  a-L  +  bj^g 

a  +  a-i  +  bn  3 

a  +  a2  ■*■  bp-i 

a  +  a2  *•■  b22 

a  +■  a2  +  bpo 
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when  the  constants  for  a,   a^,  and  b^j  are  combined  into  a  subclass  constant. 
This  would  be  necessary  in  order  to  absorb  all  equations  for  the  a+ai+b^j.  Als<^ 

^111  Qni\ 


B2 


Constants  to  be  fitted  have  been  arbitrarily  divided  into  two  sets.  The  B-^  set 

of  constants  includes  only  those  constants  that  can  be  conveniently  absorbed,  [i 
or  a  and  all  other  constants  which  represent  the  class  or  subclasses  effects. 
The  X  matrix  is  therefore  composed  of  only  zeros  and  ones  giving  the  c].ass  or 
subclass  within  which  each  of  the  observations  fall.  The  B2  set  of  constants 
includes  aTl  constants  to  be  fitted  other  than  those  to  be  absorbed.  The  need 
for  grouping  all  constants  to  be  fitted  into  these  two  sets  will  become  clear 
as  the  absorption  process  and  completion  of  the  least-squares  analysis  is  pre- 
sented in  general  matrix  notation. 


Least-Squares  Equations 
The  least-squares  equations  for  model  {$)   are  given  in  tabular  form  below: 
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a 

h^ 

Ck 

(ac)ik 

d 

g 

RHM 

a:           n««« 

^i.. 

Dij. 

n..k 

ni.k 

D...    ^ 

(j»*» 

Y... 

ai: 

0 

"ij. 

"i.k 

ni.k 

Di- 

Gi.. 

Yi.. 

^±V 

0 

"i;}k 

^ijk 

Din 

Gij. 

^ij. 

Ck- 

0°'*k 

^•k 

I5-k 

^••k 

Y..k 

(ac)ik: 

Q^±.)P 

Di.k 

Gi.k 

Yi.k 

d: 

ZZZZDiikl 
ijkl 

ZZZZDijklGi 
ijkl 

-Jkl 

ZZZZDijkiyijkl 
ijkl 

g: 

7.7.Z2Gi3icl 
ijkl 

ZZZZC^-  jkl^i  jkl 
ijkl 

Elements  to  the  left  of  the  main  diagonal  in  the  coefficient  matrix  have  been 
omitted  since  they  would  be  the  same  as  those  given  on  the  right  of  the  main 
diagonal. 

Before  direct  inversion  of  the  least-squares  coefficient  matrix  can  be 
made  or  a  iinique  solution  of  the  equations  can  be  obtained  it  is  necessary  to 
impose  certain  restrictions.  An  appropriate  set  of  restrictions  is  that  Z  aj_  ■ 

Z  bj_4  »  Z  cic  "  Z(ac)i3^  =  Z(ac)iic  "  0.  The  subtractions  and  additions  required 
0      k     i        ic 

among  the  coefficients  and  RHM's  in  the  least-squares  equations  when  these  re- 
strictions aj^  imposed  have  previously  been  described  except  fgr  the  restric- 
tion that  Z  bij  =  0.  When  this  restriction  is  imposed  on  the  b±j   the 

coefficients  "for  each  of  the  b;j^q.  are  subtracted  from  the  other  coefficients 

for  bi -;,  but  only  within  the  i^  A  class  of  effects,  i.e.,  the  coefficients  for 
the  last  bij  in  each  A  class  are  subtracted  frcan  the  coefficients  for  the  other 
bij  within  the  corresponding  A  class*  After  this  subtraction  by  columns  the 

same  type  of  subtraction  is  completed  by  rows,  in  which  case  the  RHM's  are  also 
involved. 

If  the  total  number  of  equations  for  a,  a^^  and  b^j  is  large  the  inverse  of 
.  the  complete  variance-covariance  matrix  and  the  estimates  of  a,  aj^  and  bj_j  can 

often  be  obtained  most  easily  by  an  indirect  but  general  method.  The  least- 
squares  equations  for  the  general  model-  (6)  in  matrix  notation  are 

[b  NJ  [bJ   [y-[ 
N' 
where,  for  the  present  example,  D  is  a  diagonal  matrix  of  the  uaa     of  order 

A        ^^  J^  A    ^ 

3  X  sj  N  is  the  matrix  of  coefficients  for  Cj^,  (ac)-y^,  d  and  g  in  the  a+a^L+bij 
equations  of  order  s  X  (r+t+2)5  N»  is  the  trajjspose^  of  N  (rows  and  coltomns  re- 
versed); S  is  the  matrix  of  coefficients  for  cj^,  (ac);j^j^,  d  and  g  in  the  cj^, 
(ac)^,  d  and  g  equations  of  order  (r+t+2)  X  (r+t+2);  I,  is  the  matrix  of  RHM's 
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for  the  a  +  a^  +  b^-j  equations,  the  Tij»j  of  order  s  x  u  with  u  being  the  num- 
ber of  dependent  variables  5  12  ^^   "^^  matrix  of  RHM's  for  the  other  eqtiations 
of  order  (r+t+2)  X  u. 

Computation  of  the  Matirbc  Inverse  and  Estimates  of  Constants 

Programs  are  generally  available  on  high  speed  electronic  computers  for 
inverting  matrices  directly  when  there  are  no  more  than  30  or  UO  equations.  A 
few  special  programs  are  available,  for  some  computers,  which  will  invert 
matrices  of  considerably  larger  size.  However,  the  cost  of  computing  the  in-  - 
verse  matrix  directly  when  there  are  more  than  about  i|.0  equations  is  usually'- very 
expensive.  With  large  least-squares  matrices  it  is  often  best  to  compute  the 
inverse  matrix  by  an  indirect  procedure2/  Suppose  the  matrix  inverse  of  the 
variance-covariance  matrix  under  model  (6)  is  partitioned  as  follows: 

[b  iri   JA  G 
[n'  s]  "  ^»  c|  • 

With  this  partitioning  of  the  inverse  it  can  be  shown  that 

c  -  (s  -  n»d-1n)-i 

G  -  -  D"1nC 

A  -  rrl  (I  -  NG» ) 
where  I  is  the  identity  or  unit  matrix.  It  must  be  realized,  of  course,  that 
restrictions  must  be  imposed  on  the  S  -  N'D~1n  equations  before  the  inverse  C 
can  be  obtained.  The  changes  made  in  the  S  -  N'B~1n  equations  as  a  result  of 
restrictions  imposed,  must  also  be  made  in  N  or  the  additional  rows  and  col- 
umns must  be  added  to  C  before  computing  G  and  A. 

Although  the  indirect  procedure  described  above  for  computing  an  inverse 
is  general  and  can  be  applied  to  any  set  of  symmetrical  equations  it  is  usually 
most  useful  for  least-squares  analyses  when  D  is  a  diagonal  matrix;.  V/hen  this 
is  true,  the  procedure  for  computing  the  "new"  coefficients  in  the  remaining 
equations  after  the  equations  for  a  set  of  classes  or  subclasses  are  absorbed 
is  given  by  S  -  N'D'^N.  Vfi.th  D  as  a  diagonal  matrix  the  calculation  of 
5  -  N'D"^  can  be  completed  most  readily  with  the  aid  of  simplified  formulas. 
For  example,  under  model  (5)  the  absorption  of  the  equations  for  a  +  a^  +  b^j 

is  accomplished  by  computing  the  following  "new"  coefficients  for  cj^,  (ac)jjt^^, 
d  and  g:  „8 

C(ckCk)  »  n..i.  -  22  — ii- 

nt         ,N     ,^  °iJk^Jk' 
C(ckCk')  "  -  22 


-3   ^i:). 

2 

C^k(ac)i^-  C  ^ac)jj,(ac)i^-  ni.k  "  2  ~T^ 


8/  This  indirect  procedure  was  first  given  to  the  author  by  Dr.  C.  R, 
Henderson  of  Cornell  University. 
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^     n. 


J    ij  • 

n.  .,  G .  . 
C(c  g)  "G..   -  ^-^f^-^ 

C[(ac)^j^(ac)^,J  -  C[Uc)^j^(ac)^,^,]  -  0 

n ,  ,  D 

J    ij  • 
ijkl  i-5^1  ^J^^ij. 

C(d  g)  -  EZZZ  D    G     -  ^  ^l'    ^^ 

G^ 

c(g  g)  -  imz  Gf ., .  -  in:  -^-^ 

ijkl  ijkl   ij  n^j^ 

Under  the  general  model  (6)  the  "new"  right  hand  toerabers  of  the  equations 
for  the  B-  set  of  constants  are  obtained  from  Y„-N'D~^Y- ;  whereas,  under  the 

specific  model  (5)  the  "new"  RHM's  for  the  c.^,  (ac).,  ,  d  and  g  equations  are 

°i1  ^ii 

ijkl  ijkl  ijkl   ij   n^^^ 

G   Y 
ijkl   ijkl  ijkl   ij   n^^^ 

It  should  be  noted  that  nij ,  appears  in  the  denominator  in  each  of  the 
N'D~%  and  N'O'^Yi  terms  given  above  for  model  (5)  and  that  a  product  or  square 
appears  in  the  numerator  in  each  case.  This  simplifies  the  computations 
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Tfftien  D  is  a  diagonal  matrix  the  matrix  multiplications  required  to  com- 
pute G  and  A  are  greatly  simplified.  Hence,  the  matrix  inverse  to  the  com- 
plete variance-covariance  matrix  can  often  be  obtained  with  a  minimum  of  time 
and  labor  even  with  a  large  number  of  least-squares  equations. 

Estimates  of  the  constants  included  in  the  Bp  set  under  model  (6)  may  be 
obtained  from      ^ 

B2  =•  (S  -  N»D""^N)"^  (^2  -  N'D-iY^). 

If  G  has  been  computed  these  estimates  may  also  be  obtained  from  Y'G  +  Y^G  or 

G*Y^  +  CY2«  When  the  estimates  of  constants  included  in  B2  are  computed  in 

both  ways  a  check  is  provided  on  the  computation  of  G, 

Estimates  of  the  constants  included  in  the  B-.  set  may  be  obtained  by  solv- 
ing the  equations  for  the  B-i  constants  after  estimates  of  the  B2  constants  have 
been  computed,  i.e.,  ^ 

B^  =  D-1  (Jj.  "  ^2^* 
If  the  G  and  A  segments  of  the  inverse  have  been  computed  these  constants  may 
also  be  obtained  from  Y'A  +  Y»G'  or  AY,  +  GY  .  These  different  methods  of  com- 

puting  Bt  provide  a  means  of  checking  the  calculations  involved  in  computing  G 
and  A« 

In  order  to  simplify  the  absorption  process  and  the  computation  of  A  and 
G  it  is  necessary,  under  model  (5)>  to  place  the  constants  a,  aj^  and  b^  4  to- 

gether.  Hence,  for  this  model,  B;^  consists  of  the  least-squares  subclass 

means,  the  cc  +  aj_  +  bj_j^  To  obtain  a  separation  of  these  constant  estimates 

one  can  iu^ose  the  restrictions  that  2aj_"2b^^'"0  and  compute  averages  of 


.•A   y\ 


the  a  +  aj_  +  bj^j,  i.e.,        2(a+aj^+bij) 

^-^-'^-qi 


22(a+ai+bi^)      2(a  +  a^) 
*   ij  i 


P 

The  estimates  for  each  of  the  aj_  and  bj  can  then  be  computed, 


2qi 

i 


A   /«     A 


With  model  (5)  the  inverse  matrix  segments  A  anri  G  apply  to  the  a+a-;+bi-i 
■<*      ^      <*  -^   J 

rather  than  to  a,  the  a^  and  b^^-j  separately.  In  msury  cases   this  may  be  all 

that  is  required.  However,  the  transformation  of  these  invgrse  segments  to 
obtain  the  inverse  elements  which  apply  dii-ectly  to  a,  the  aj^  and  b^j  is  easily 

accomplished.  The  general  method  of  transforming  sub- inverse  matrices,  such  as 
A  and  G,  was  presented  in  previous  sections  and  will  not  be  repeated  here  for 
this  special  case. 

-92- 


raaiiii'iii .  •'__ _ 

« r. -^,,h   oi.rr»nr-*«  Onlv     RaoroductJon  01  ihis  itotn  Way  Constituto  CoDvrJcht  tnfrlnacnient 


Computation  of  Sums  of  Squares 

When  constants  are  fitted  for  all  degrees  of  freedom  among  a  set  of  sub- 
classes (two  factor,  three  factor,  etc.)  except  the  highest  order  Interaction, 
the  least-squares  sum  of  squares  for  that  interaction  can  be  computed  by  a  dif- 
ference, as  was  shown  for  the  two-way  classification.   Hazel  (1946)9./  gave  a 
method  of  computing  the  least-squares  sum  of  squares  for  a  two-factor  inter- 
action when  constants  were  fitted  for  two  main  effects  and  a  partial  regres- 
sion. The  method  of  Hazel  requires  that  the  expected  subclass  means  be  com- 
puted from  the  constant  estimates  obtained  when  the  highest  order  interaction 
is  assumed  to  be  non-existent.   The  sum  of  squares  for  interaction  is  then 
computed  by  multiplying  the  squared  difference  between  the  observed  subclass 
mean  and  the  expected  subclass  mean  by  the  number  of  observations  in  the  sub- 
class and  accumulating  over  all  subclasses.   This  method  of  computing  the  sxxm. 
of  squares  for  the  highest  order  interaction  when  constants  for  all  other  ef- 
fects among  the  smiallest  subclasses  have  been  fitted  is  easily  extended  to  more 
complex  analyses.   However,  it  should  be  pointed  out  that  no  method  is  avail- 
able for  obtaining  unbiased  tests  of  significance  for  two  or  more  interactions, 
unless  constants  are  fitted  for  all  interactions  except  one.   Even  then,  if  the 
test  of  significance  for  the  interaction  omitted  (usually  the  highest  order  in- 
teraction) is  significant  and  constants  were  not  fitted  simultaneously  for  this 
interaction  the  estimates  of  constants  for  all  other  effects  are  biased. 

The  appropriate  error  sum  of  squares,  for  use  in  testing  the  significance 
of  the  BC:A  interaction  omitted  in  the  least-squares  analysis  undei:  model  (5), 
is  computed  from  y2  n   v 

2  ijk       >5,  °iik  ijk, 

ijkl   ^J^^        ijk     ijk  ijkl     ^^^^   ^J^^       ijk       \jk 

-^ I  /  /^  ilk  iik^ 

-  g'dlZZZ  '^iiiciy.ikl   ■  ^^       n       ^ 
ijkl      ^^'^^   ^^^^        ijk       ^ijk 

where  d'  and  g'  are  the  partial  regressions  of  y  on  D  and  G  computed  on  a  with- 
in ABC  subclass  basis. 

The  sum  of  squares  for  BC:A  interaction  can  most  easily  be  computed 
directly  from 

R[a*a^,b^^,c^,(ac)^^,(bc)^^^,d,g]  -R[a,a^,b^j,Cj^,(ac)^j^,d,g) 

where  Y^ 

ncc,a^,h^^,c^,(ac)^^,ihc)^^^,d,g]   -SSZ  -^  +  R(d',g'), 

rather  than  by  the  indirect  method  of  Hazel. 

The  error  sum  of  squares  when  the  BC:A  interaction  does  not  exist  is  com- 
puted from         2      „r     ,       /   y    ,   1 
l^^ijkl  -  RC«,^i,b^j,c^,(ac)^^,d,gI 

where  the  total  reduction  in  sum  of  squares  due  to  fitting  all  constants  is 
computed  by  sunaning  the  products  of  the  constant  estimates  and  the  correspond- 

ing  RHM's  in  the  usual  manner.   The  mean  square  resulttnp:  from  dividing  th^e 

9_/   Hazel,    L.   N,      The   covariance  analysis   of  multiple   classification   tablcf?.  with 
unequal   subclass  numbers.      Biometrics   Bulletin   2:21-25.      1946. 
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sum  of  squares  obtained  in  this  way  by  the  degrees  of  freedom  for  error  is 
biased  upwards  if  the  BC:A  interaction  exists.   Hence,  the  error  mean  square 
used  by  Hazel  to  test  the  significance  of  his  two-factor  interaction  was  in- 
appropriate and  would  have  been  biased  upwards  if  the  interaction  had  existed. 

The  least-squares  (or  adjusted)  sums  of  squares  for  A,  B:A,  C,   AC,  re- 
gression d  and  regression  g  may  be  computed  by  the  general  direct  method, 
B'Z~1b,  previously  described,  when  the  inverse  of  the  complete  variance- 
covariance  matrix  is  available.   The  adjusted  sums  of  squares  for  those  sources 
of  variation  which  have  only  a  few  degrees  of  freedom  can  almost  always  be 
computed  most  easily  by  this  direct  method.   In  many  cases,  however,  when 
model  (5)  is  applicable  there  will  be  a  considerable  number  of  degrees  of  free- 
dom for  B:A  as  compared  with  each  of  the  other  sources  of  variation.   In  this 
case  it  may  be  somewhat  easier  to  compute  the  sum  of  squares  for  B:A  by  the 
indirect  procedure,  i.e.,  from 

R^,a^,b^j.,.c^,(ac)^j^,d,g]   -  R{a,a^,c^,(ac)^^,d,gJ  . 

The  decision  concerning  which  procedure  to  use  in  each  case  is  largely 
determined  from  the  rank  of  the  matrix  inverse  segmcnt(s)  that  must  be  invert- 
ed if  the  direct  method  is  used,  as  compared  with  the  number  of  equations 
which  must  be  solved  if  the  indirect  method  is  used.   For  example,  if  p  «  2, 


q^  -  10,  qj 


15,  and  r  »  3  the  indirect  method  should  definitely  be  used  to 


compute  the  sum  of  squares  for  B:A,  since  it  would  require  the  solution  of 
only  8  eqixations.   The  direct  method  would  require  the  inversion  cf  one  9x9 
matrix  and  one  14x14  matrix  as  well  as  the  matrix  multiplications,  B'Z~-1-B,  to 
obtain  the  same  stim  of  squares. 

Least-Squares  Means,  Standard  Errors  and  Individual  Comparisons 

When  all  effects  are  fixed,  except  for  the  errors,  the  investigator  is 
primarily  concerned  with  the  least-squares  means  with  their  standard  errors 
and  tests  for  the  significance  of  contrasts  among  the  means.   The  least- 
squares  means  are  constiructed  from  the  constant  estimates,  e.g.,  under  model 

(5)      <s    ^    /s  _    ^  _ 

H-a+dD+gG----------  Overall  mean. 

|x  +  Hi  ---------------  Means  of  the  A  classes. 

\i  +  Sij^  +  h^x    ------------  Means  of  the  AB  subclasses. 

^^"'■Ck  ---------------  Means  of  the  C  classes, 

\i.  +  ^    +Cj^  +  ^c)^j^ Means  of  the  AC  subclasses. 

In  each  case,  these  means  estimate  the  class  or  subclass  arithmetic  mean  that 
would  be  expected  if  a  new  set  of  data  was  taken  in  the  same  manner,  except 
that  equal  subclass  numbers  would  be  achieved  and  the  mean  of  the  Dj[jkl  and 
Gijkl  fo'^  sll  classes  and  subclasses  would  equal  5  and  G.   When  unequal  fre- 
quencies and  variations  in  the  means  for  D  and  G  are  a  result  of  chance  rather 
than  a  result  of  the  effects  being  considered,  these  means  (which  are  referred 
to  here  as  the  least-squares  means)  are  the  ones  desired.   Unequal  weighting  of 
the  individual  effects  in  obtaining  expected  means  may  sometimes  be  desirable. 
For  example,  if  the  a   represent  sire  effects  and  the  c   represent  age  of  dam 
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effects  one  could  logically  be  interested  in  computing  the  expected  sire  means 
when  each  sire  is  mated  to  different  numbers  of  dams  of  the  different  age 
classes.  In  this  case,  ^    ^   ^  _.   ^  _      ^ 

^i,»  =a  +  dD+gG+Z  Wj^Cj^ 

k 

•where  the  w,  are  the  veights  to  be  given  each  age  of  dam  effect  based  on  the 

number  for  each  age  of  dam  class  that  is  to  be  considered  and  2  wj^  =  1,  The 

variance  of  ^x'  +  aj^  "would  be  different  from  the  variance  of  ^x  +  3^,   of  course, 

but  could  still  be  computed  from  the  inverse  elements  of  the  matrix  inverse  by- 
making  use  of  the  general  formula  for  computing  the  variance  of  a  sum. 

Standai'd  errors  for  least-squares  means,  or  for  differences  among  the 
constant  estimates  within  a  set,  are  computed  from  the  inverse  elements  and  the 
estimate  of  of  in  the  usual  manner  that  has  previoiisly  been  described. 

Duncan's  Multiple  Range  Test,  as  modified  by  Krams2;±2/may  be  used  to  make  pair- 
wise  comparisons  or  tests  of  significance  for  orthogonal  comparisons  can  be 
completed.  Methods  for  completing  these  tests  in  least-squares  analyses  with 
fixed  effects  were  also  described  in  previous  sections. 

Estimation  of  Variance  Comronents 

When  all  effects  in  model  (5)  are  random,  except  a,  d  and  g  the  expec- 
tations of  the  mean  squares,  E(MS),  obtained  with  a  complete  least-squares 
analysis  are  as  follows: 

d.f.  E(MS) 


A 

P-1 

^  -*-   ^o|c  ■*•  k8og 

B:A 

s-p 

<  ^   ^6^:a 

C 

r-1 

<   -^  ^J.4c  "  ^5^ 

AC 

t-p-r+1 

<^  -  ^3^0 

Regression  d 

1 

-|^^2^ 

Regression  g 

1 

^  *  ^l^g 

Error 

n.«,-s-t+p+3 

<4 

These  expectations  point  out  that  exact  tests  of  significance  are  available  for 
each  of  the  two  regression  coefficients,  the  AC  interaction  and  the  B:A  effects. 
Since  k^,  kL,  and  ky  are  aU  different  coefficients,  with  unequal  subclass  num- 
bers as  are  klso  k^  and  kg,  no  exact  test  of  significance  is  available  for  the 
effects  of  A  or  C.  If  the  interaction  of  A  and  C  is  non-existent  the  error 


10/  Kramer,  C.  Y.  IS'57.  Ibid. 
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mean  square  is  the  appropriate  mean  square  for  testing  C  effects o  However, 
since  each  Bxxm.   of  squares  in  the  complete  least-squares  analysis  has  been 
adjusted  for  possible  effects  of  all  other  sources  of  variation  considered 
this  may  give  an  inefficient  test  depending  largely  on  the  amount  of  unequal 
frequencies  existing  in  the  data.   In  fact,  if  any  of  the  effects  included 
in  the  model  are  really  non-existent  all  tests  of  significance  and  estimates 
of  variance  components  are  less  precisely  determined  than  would  be  the  case 
if  such  effects  are  omitted  from  the  model  and  the  data  reanalyzed,  A  short- 
cut matrix  multiplication  procedure  for  use  in  deleting  sets  of  effects  is 
presented  in  the  next  major  sub-section  so  that  more  precise  results  can  be 
obtained* 

Since  the  effects  of  the  regressions  are  fixed  there  is  no  reason  for 
computing  k  and  k-.  All  other  k's  are  needed  in  order  to  obtain  the  es- 
timates of  the  variance  components.  The  coefficients  k-,  k_,  k,  and  k_  may 

be  obtained  by  the  direct  method,  as  previously  described,  at  the  same  time 
the  sums  of  squares  are  computed o  However,  if  the  number  of  B  classes  in 
the  different  A  classes  is  large  relative  to  the  number  of  A  and  C  classes 
it  will  be  best  to  compute  k,  and  the  sum  of  squares  for  B:A  by  the  indirect 

0 

method.  The  remaining  coefficients  k, ,  k_  and  k„  can  only  be  obtained  by  the 

'f    /       o 

indirect  method  at  present  if  any  of  the  n.  .  ■  0.  If  all  n.  ,^0,  k,  ■ 

m  a  i.lC  X.K4 

—  k_  and  k_  ■«  —  k-.  Even  though  all  n.  ,  >0,  k-,  must  still  be  computed  by 
the  indirect  method. 

Mixed  Model 

In  many  problems  the  a.  and  b.   effects  are  random  and  the  c,  effects, 

as  well  as  the  regressions,  are  fixed.  For  this  case,  Hendersonii./  has 
presented  a  short-cut  method  (  referred  to  by  him  as  Method  2)  of  computing 
unbiased  estimates  of  the  variance  components  when  p  and  the  q  are  large. 

The  method  is  also  applicable  when  all  effects  are  random. 

Method  2  of  Henderson  gives  precisely  the  same  estimates  for  the  vari- 
ance components  that  would  be  obtained  with  the  complete  least-squares 
analysis.  Hence,  the  method  gives  unbiased  estimates  and  provides  accurate 
adjustment  for  the  fixed  effects. 

Using  Method  2  of  Henderson,  the  analysis  of  a  set  of  data  under  model 

(5)  when  the  a.,  the  b . ,  and  e. ,,  ,  are  random  and  all  other  effects  are 
1'      ij      Ijkl 

fixed,  may  be  completed  with  the  following  steps: 


11/  Henderson,  C.  R.  1953,  Ibid. 
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step  1«   Absorb  the  equations  for  a  +  a^  +  bj_4  into  the  eqiiations  for 

the  cy-f   the  (ac)jjj.,  d  and  g  in  the  manner  described  above.  In 

general,  matrix  notation  for  any  analysis  this  step  is  accomplished 
by  computing  the  S  -  N»D-1n  and  the  Y2  -  K'D~1Y3_  values. 

Step  2,   Impose  the  restrictions  (or  constraints)  on  the  constant  esti- 
mates so  that  Z  C|^  =  2(ac)^  »  2(ac)^j^  =  0  and  complete  the 

k     i        k 
appropriate  subtractions  and  additions  by  columns  and  rows  among  the 
"new"  coefficients  and  "new"  right  hand  members. 

Step  3«   Invert  the  reduced  coefficient  matrix  for  the  cj^,  (ac)^]^,  d  and 
g  and  compute  the  estimates  for  these  constants  e3+-her  from  a  direct 
solution  of  the  equations  or  from  C"^  Of2-N'D"^yx)  ;  where  C"-^  is  the 
matrix  inverse  and  (Y2-N'D~1Yi)  refers  to  the  right  hand  members. 

Step  U.   Compute  the  least-squares  sums  of  squares  for  C,  AC,  Regression 
d.  Regression  g  and  Error,  The  eri'or  sum  of  squares  is  computed 
from 

Error  S,  Sqs.  =  2ZZZyj_^j^i  -  R  [a,ai,bij,Ck,  (ac)jj^,d,g) 
ijkl 

T?. 
-  2ZZZy  2   -  ZZ  j^  -  B'(y2  -  ^^B'^Y-^) 
i;3kl  -^  ij  ^^*    2 

where  Bl   is  a  row  vector  of  the  cj^, (ab)2j^,  d  and  g  and  Yp  -  N'D'^Y-, 

is  a  column  vector  of  the  corresponding  RK^!•s  in  the  reduced  equa- 
tions • 

In  most  cases,  the  least-squares  sums  of  squares  for  C,  AG  and 
the  two  regressions  can  most  easily  be  computed  by  the  general 
direct  method,  B'Z^l-B,  as  previously  described. 

Step  $,   Set  up  the  analysis  of  variance  for  the  fixed  effects  and  com- 
pute the  F  values  for  test  of  significance  as  indicated  below: 

SoiiTce  of  Variation 

c 

AC 
Regression  d 

Regression  g 

Error 

If  the  interaction  of  A  and  C  is  likely  to  be  non-significant  and 
small,  one  should  first  compute  the  sums  of  squares  for  error  and 
the  AC  interaction.  When  the  F  test  clearly  indicates  that  the 
(ac)jji^  effects  are  near  zero,  the  (ac)jj^  effects  can  be  deleted 
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r-1 

E(MS) 

t-p-r+1 

<4  ■"  ^y^c 

1 

o|  +  k20§ 

1 

^a  *  ^l^g 

n«..-s-t+p+3 

^ 
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before  completing  the  test  of  significance  for  the  fixed  effects,  A 
short-cut  procedure  for  use  in  deleting  a  set  of  effects  is  given  in 
the  next  sub-section. 

No  exact  test  of  significance  exists  for  the  c^   fixed  effects 
when  the  aj_  effects  are  random  and  the  (ac)j^j^  do  not  equal  zero. 

However,  the  c-^   are  unbiased  estimates  of  the  effects  of  the  C 

classes  and  may  therefore  be  used  to  adjust  either  these  data  or 
other  data  of  a  similar  nature.  If  the  AC  interaction  is  found  to 
be  non~significant  and  small  and  is  then  deleted  from  the  analysis, 
an  exact  test  of  significance  can  be  made  for  the  cj^  effects.  In 
this  case,  mean  separation  procedures  for  pairvTise  comparisons  or 
tests  of  significance  for  orthogonal  contrasts  among  the  cy^   may  be 
completed  too  as  has  previously  been  outlined. 

Step  6»   Compute  the  adjusted  totals /I! »  ^  for  the  AB  sub-classes  as 

^  i^* 
follows : 

^ij.  =  ^ij.  "  i^Jk  ^k  -  ^iak(^"^)ik  -  d  Dij,  -  i  Gij, 

The  general  formula  in  matrix  notation  for  computing  the  adjusted 
totals  for  the  classes  or  subclasses  absorbed  is  Y' •  =  Y  -  NB«, 

The  adjusted  totals  for  the  A  classes  are  then  computed  by  summing 
the  Y!'.  over  j,  i.e., 

Yii  =  2  Y»» 
Step  7«   Compute  the  "adjusted"  sums  of  squares  for  A  and  B:A  and  the 

eTms). 

The  adjusted  sums  of  squares  are  computed  as  follows: 

y!i2   Yit2 


Adjusted  S.  Sqs.  -  -  A»  =  Z  —- 

i  ^t* 


•  •  • 

I... 


^ 


2  2 

Y'l     YM 
Adjusted  S.  Sqs.  -  -  B«:A'  -  2Z  -=^  -  Z   —^  . 

ijHij.     .Hi.. 

These  adjusted  sums  of  squares  for  A  and  BtA  will  not  be  the  same 
as  the  least-squares  sums  of  squares  since  this  procedui^e,  in 
effect,  assumes  that  the  adjustments  for  the  effects  of  c^,    (ac)^]^ 

d  and  g  are  made  without  error.  Since  the  cj^.,  (ac).y^,  d  and  g  are 

measured  with  error,  the  expectations  of  the  mean  squares  for  A' 
andB»:A.  are   ^^ 

A'     p-1     ^12^  *  ^:a*  ^ll;^ 
B«  :A'   a-Tp     ^10°^  *  ^ll°b:a 
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The  unusual  feature  of   these  expectations  of  mean  squares  is  the 
coefficient  for  o|  in  both  the  A'  and  the  B'  :A'  mean  squares.  Usu- 
ally this  coefficient  is  unity  in  each  case,  but  since  k-,Q  and  k-j^ 

are  not  imity  it  is  necessary  to  computa  them  in  order  to  obtain  un- 
biased estimates  of  a.  ^  and  o^.  In  addition,  it  is  necessary  to 

compute  k-Q,  k-,,,  and  k^i  .  These  last  three  coefficients  are  com- 
puted by  the  standard  procedure  for  computing  coefficients  of  vari- 
ance components  with  lonequal  frequencies  in  the  nested  classifi- 
cation, i.e.. 


■^13 


>=u, 


The  calculations  required  in  the  computation  of  k,-  and  k,2  are 
somewhat  more  involved  than  in  the  computation  of  ^jl*   ^13  ^^^  ^Ik* 
However,  at  this  stage,  many  of  the  values  required  to  compute  k-.-. 
and  k-jp  will  have  been  computed  during  the  absorption  of  the 
a  +  aj_  +  bj_^  equations  and  the  inversion  of  the  reduced  matrix, 
since 


i.^^zzci%.^.P,^^- 


ki2  - 


The  C-^J  are  the  inverse  elements  in  the  matrix  inverse  to  the 
reduced  least  squares  coefficient  matrix  after  absorption.  The  P* » 

elements  are  the  values  obtained  in  the  absolution  of  the  a+a^+bj_4 
equations,  i.e.,  the  N^D'-'-N  values;  the  P*  are  the  values  obtained 

from  the  N»D  tI  multiplication  if  the  h^^   effects  are  deleted  from 
the  model  and  only  the  a  +  aj^  equations  were  to  be  absorbed;  and  the 
P^j  are  the  values  obtained  from  the  N^D"^  multiplication  if  both 
the  a^  and  b^j  effects  are  deleted  from  the  model  and  only  the  a 
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II 


I 


1 


equation  is  absorbed.  For  example. 


J"!!- 

^n 

^12' 

^1^ 

n.  j-n.  -rt 
^^     i.jl  ij2 

etc. 

1  °1.. 

etc. 

pi     « 
12 

i    "i.. 

etc. 

The  C  and  P  matrices  must  all  be  of  the  same  sisse.  In  this 
case,  the  order  of  these  matrices  may  all  be  either  (r+t+2)  X 
(r+t+2)  or  (t-p+2)  X  (t-p+2),  depending  on  whether  the  inverse 
elements  for  the  columns  and  rows  deleted  as  a  result  of  imposing 
restrictions  are  computed  and  added  back  into  C,  or  whether  the 
same  subtractions  and  additions  are  completed  in  the  P  matrices  as 
was  done  in  the  S-N'D~1n  matrix.  Both  procedures  will  yield  the 
same  results*  In  fact,  these  subtractions  and  additions  by  columns 

and  rows  in  the  P  matrices  may  be  completed  in  the  P! '  -  P'  and 

ij    ij     . 
the  P! .  -  P. ,  matrices,  if  desired. 

Elimination  or  Iteletion  of  a  Set  of  Effects 

If  the  test  of  significance  for  the  AC  interaction  described  above  indi- 
cates that  the  (ac)jj^  effects  probably  equal  zero  the  inverse  elements  associ- 

ated  with  the  Cj^,  d  and  g  may  be  adjusted  by  a  deletion  procedure.  The 

ad;5usted  inverse  elements  are  the  inverse  elements  that  would  be  obtained  if 
all  equations  for  the  (ac)j^j^  are  deleted  in  the  reduced  matrix  and  the  matrix 

of  coefficients  for  the  cj^,  d  and  g  was  inverted  in   the  usual  manner.  The 

adjusted  inverse  elements  are  computed  with  the  following  matrix  arithmetic: 

where  C^-*-  is  the  matrix  of  inverse  elements  associated  with  the  Cj^,  d  and  g  in 

the  C"!  matrix;  Z"-'-  is  the  inverse  of  the  square  segment  of  C~l  corresponding 
AC 
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to  the  (ac)-itr.  and  R  ±3   the  matrix  of  inverse  elements  which  associate  the 
(ac)jj^  with  the  cj^,  d  and  g. 

Since  ZTi  is  used  to  compute  the  least-squares  sum  of  squares  for  AC,  the 
AC 

additional  labor  required  to  obtain  the  new  inverse  matrix  is  often  small  in- 
deed as  compared  with  the  direct  inversion  of  the  matrix  of  coefficients  for 
the  cic,  d  and  g,  VJith  other  models  the  difference  in  time  required  to  compute 

the  reduced  inverse  matrix  is  likely  to  be  considerably  greater  than  with  the 
particular  model  under  discussion. 

The  estimates  of  the  cj^,  d  and  g  unadjusted  for  the  unequal  frequencies 

associated  with  the  AC  interaction,  may  be  obtained  by  multiplying  the  new  in- 
verse elements  by  the  appropriate  RHM's«  Sims  of  squares  for  C,  Regression  a 
and" Regression  g  are  computed  by  the  direct  method  from  the  B'Z~1b  multipli- 
cation using  the  new  constant  estimates  and  the  new  inverse  elements.  Also, 
the  sum  of  squares  for  error  must  be  recomputed,  since  the  degrees  of  freedom 
used  for  the  AC  interaction  must  now  be  placed  into  error. 

Numerical  ExamiDle 

The  average  daily  gain  (iDG)  for  each  of  65  steers  of  the  Hereford  breed 
c»rranged  according  to  line,  sire  and  age  of  dam  is  given  in  the  table  below. 
The  age  at  weaning  and  initial  weight  at  the  beginning  of  the  test  feeding 
period  is  also  given.  All  of  these  steeirs  were  fed  for  the  same  length  of  time 
in  the  feed  lot.  Identification  numbers  have  been  assigned  for  line,  sire  and 
steer  for  covenience.  In  addition,  the  original  data  have  been  rearranged,  for 
illustrative  purposes. 


Age  of 

steer 

Initial 

Average 

Line  No, 

Sire  No,           Dam 

No, 

kflQ 

Weight 

Daily  Gain 

1 

1                3 

1 

192 

390 

2,2]^ 

2 

25U 

li03 

2,65 

h 

3 

185 

U32 

2, la 

li 

183 

167 

2.25 

5-tjp 

5 

186 

W3 

2.58 

6 

177 

k69 

2,67 

7 

177 

h2Q 

2,71 

8 

163 

k39 

2,li7 

2                  h 

9 

188 

h39 

2,29 

10 

178 

U07 

2.26 

5-up 

11 

198 

198 

1.97 

12 

1^3 

k^9 

2.1ii 

13 

186 

h59 

2,lili 

lii 

175 

375 

2.52 

15 

171 

382 

1.72 

16 

168 

hn 

2,75 

3                3 

17 

15U 

389 

2,38 

h 

18 

18U 

klh 

2.U6 

5-Tjp 

19 

17U 

U83 

2.29 

20 

170 

U30 

2,30 
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Line  No,       Sire  No, 


Age  of 

steer 

Initial 

Average 

Dam 

No. 
^1 

if 

Weight 
1X3 

Daily  Gain 
2.9U 

3 

22 

358 

381 

2.50 

23 

158 

365 

2.UU 

U 

2U 

169 

386 

2,UI; 

25 

2hh 

339 

2.15 

^•-up 

26 

2$9 

ia9 

2.5U 

27 

152 

h69 

2.71* 

28 

11*9 

376 

2.50 

29 

li49 

375 

2.5U 

3 

3C 

189 

395 

2,65 

h 

31 

187 

Uii7 

2.52 

32 

165 

U30 

2.67 

5-up 

33 

181 

li53 

2.79 

3U 

177 

385 

2.33 

35 

151 

lOl* 

2.67 

36 

ll;7 

353 

2.69 

h 

37 

18U 

lOl 

3.00 

38 

18U 

ii20 

2.it9 

5-up 

39 

187 

U27 

2.25 

UO 

18U 

h09 

2.1*9 

la 

183 

337 

2.02 

U2 

177 

352 

2.31 

3 

lt3 

205 

U72 

2.57 

iOi 

193 

3U0 

2.37 

li 

h^ 

162 

375 

2.61* 

5-up 

k6 

206 

U51 

2.37 

hi 

205 

1*72 

2.22 

hQ 

187 

li02 

1.90 

h9 

178 

h6k 

2.61 

50 

175 

UlU 

2.13 

3 

51 

200 

U66 

2.16 

52 

18U 

356 

2.33 

53 

175 

Ul*9 

2.52 

U 

51* 

178 

360 

2.1*5 

5-up 

$^ 

189 

385 

iJih 

^6 

181; 

I131 

1.72 

57 

183 

liOl 

2.17 

3 

58 

166 

Uol* 

2.68 

li 

59 

187 

li82 

2.1*3 

60 

186 

350 

2.36 

61 

18U 

U83 

2.UU 

5-up 

62 

180 

U25 

2.66 

63 

177 

U20 

2.1i6 

6U 

175 

iil49 

2.52 

65 

16U 

U05 

2.1i2 
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The  mathematical  model  underlying  the  analysis  to  be  made  with  these  data  is 
yijkl  -  a  +  gi  +  3ij  *  ^k  +  (ga)ik  +  bAijki  "^  ^ijkl  +  ©ijkl 

i  -  1,  2,  3  k  -  1,  2,  3 

d  -  1,  2,  3,  U        1  =  1,  2,  ...,  6 

wb9re:         yijkl  "  "^^   average  daily  gain  for  the  1'*'^  steer  in  the  k""^  age 

of  dam  class  by  the  j"**^  sire  of  the  1'^^  line,  , 

a  ■  the  theoretical  population  mean  T«3ith  equal  subclass 
frequencies  when  weaning  age  and  initial  weight  both 
are  equal  to  the  absurd  value  of  zero.  The  population 
mean  with  equal  frequencies  when  the  weaning  age  and 
initial  weight  both  are  equal  to  the  average  is 

H  "  a  +  bl  +  dV?, 

gi  -  effect  of  the  i*^  breeding  group  or  line, 

3^5  ■  effect  of  the  j"^h  sire  in  the  1"^^  lino, 

aj^  «  effect  of  the  k*^  age  of  dam  class, 

(ga)jj^  "  interaction  effects  for  line  and  age  of  dam, 

b  »  partial  regression  of  ADG  on  age  at  weaning.  Since  all 
steers  were  placed  on  test  at  the  same  time  this 
regression  coefficient  will  measure  the  effect  of  time 
of  birth  on  ADG  in  the  feed  lot, 

Aijjjj_  ■  age  at  weaning  for  a  given  steer, 

d  -  partial  regression  of  ADG  on  initial  weight, 

Wj^jjjjL  ■  initial  "vreight  for  a  given  steer, 

ej[4jjQ^  ■  random  errors* 

Coraplete  Least-Squares  Analysis 

The  least-squares  equations  are  presented  in  Table  1.  The  reduced  set  of 
least-squares  equations,  resulting  from  imposing  the  restrictions  that  Jg^  " 

i 
ZSjLJ  ■  2i^  -  2(ga)jj^  -  Z(ga)jj^  -  0  and  after  completing  the  appropriate  sub- 

tractions  and  additions  by  columns  and  rows,  is  presented  in  Table  2,  The 
matrix  inverse  to  the  reduced  least-squares  matrix  is  given  in  Table,  3« 
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Estimates  of  the  constants  obtained  from  the  solution  of  the  reduced  set  of 
equations  are  as  follows: 12/ 


a  " 

3.06106 

gl- 

-.08035 

A 

.03730 

A 

=11" 

.09051 

A 

^12  " 

-.10750 

A 

=21- 

-.H5>i7 

A 

S3I     - 

.I6581i 

A 

=32- 

.Olt098 

A 

-.26909 

^1" 

.O69U6 

A 

-.00611 

(ga>ii  - 

-.11005 

(ga)^  - 

.013m 

(^)2i  - 

.02522 

(ga)22  " 

-.08066 

A 

b  - 

-.008l5h 

A 

d  - 

.001969 

Estimates  for  the  overall  mean,  ji,  and  the  constants  omitted  by  the  subtractions 
are  computed  as  folJLows: 

A      A      A.      A. 

p.  »  a  +  bA  +  dW 

-  3.O6IO6  -  (.008l51i)  (176.65)  +  (.001969)  (la6»85) 

-  2.lUai43 


^13- 
S22  - 

^3li" 

A 

a.5   ■ 


^3 
(g*a)i3 

(ga) 


(ga) 

(^) 
(gk) 


23 
31 
32 
33 


(-.08035  +  .03730)  -  .0^305 
(.09051  -  .10750)  "  .01699 
(-.11917)  -  .11917 
(.1658U  +  .Oli098  -  .26909)  -  .06227 
(.069U6  -  .00611)  -  -.06335 

(-.11005  +  .01311;)  =  .09691 
(.02522  -  .08066)  «  .oSShh 

(-.11005  +  .02522)  -  .08ii83 

(.01311*  -  .08066)  -  .06752 

(.081;83  +  .06752)  -  -.15235 


12/  Several  extra  decimal  digits  are  given  so  that  sums  of  squares 
compuiied  later  will  be  more  accurate. 
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Analysis  of  Variance 

A  test  of  significance  for  the  possible  interaction  of  sires  with  age  of 
dam  (SA:G)  will  first  be  completed.  The  least-squares  equations  for  b'  and  d' 
on  a  within  line  X  sire  X  age  of  dam  subclass  basis  are  as  follows: 

4,830„38  9'  +  8,046.82  d'  —19.5927 

8,046.82  6'  +  68,016.78  d'  -  73.5537 

The  estimates  of  b'  and  d'  are  obtained  by  solving  these  two  equations. 

b'  -  -.007295 

d'  "   .001945  :"^. 

The  error  sum  of  squares  for  testing  the  significance  of  the  SA:G  interaction 

^. 

2EIX  yf ,,    -  2:22  -IJ^  _  R(b',d') 
.        ijkl     ^^^       ijk  ^jk 

-  2.2673  -  (-.007295)(-19.5927)  -  (.001945)(73.5537) 

-  2,2673  -  .2860  -  1.9813 

The  GSA  observed  and  expected  subclass  means  are  given  in  Table  4.  The  ex- 
pected means  are  computed  by  first  adjusting  the  subclass  total  (for  Y)  with 
the  constant  estimates.  For  example,  the  expected  subclass  mean  for  line  1, 
sire  1  and  age  of  dam  3  is 

•|[4.89  +  .08035  -  .09051  -  ,06946  +  ,11005  + (.008154) (173.00  -  176.65) 

-  (.001969)(396.50  -  416.85)] 

-  2.4654 

The  sum  of  squares  for  the  SA:G  interactioi;i  is  computed  from  Table  4  as 
follows: 

S.  Sqs.  -  -  -  SA:G  -  (2)(-.0204)^  +  (2)(.0059)^  +  +  +  (4)(-,0183)^ 

-  .4890 

This  sum  of  squares  may  be  more  easily  computed  from  330.6325  +  .2860 

-380.3931  -  ,5254.  Rounding  errors  probably  account  for  the  apparent  , 

descrepancy  in  the  two  methods. 

The  test  of  significance  for  the  SA:G  Interaction  may  now  be  completed  as 
follows: 

Source  of  Variation  d.f .   S.Sqs.  M.S. 

SA:G  10    .4890  .04890  n.s. 

Error  38   1.9813  .05214 
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Table  h*   Observed  and  expected  GSA  subclass  means 


I*ine 

Sire 
No. 

Age  of 
Dam 

^ijk 

Means 

No. 

ObservBd 

Expected 

Obs.-Exp. 

1 

1 

3 

2 

2.1iJ450 

2.U65It 

-.020U 

1 

1 

U 

2 

2.3300 

2.32U1 

.0059 

1 

1 

5 

It 

2.6075 

2.5761 

.031it 

1 

2 

k 

2 

2.2750 

2.3852 

-.1102 

1 

2 

S 

6 

2,2567 

2.28U6 

-.0279 

1 

3 

3 

1 

2.3800 

2.35ia 

.0259 

1 

3 

It 

1 

2.1i600 

2.5819 

-.1219 

1 

3 

5 

3 

2.5100 

2.U815 

.0285 

2 

U 

3 

2 

2.U700 

2.U307 

.0393 

2 

U 

It 

2 

2.2950 

2.3507 

-.0557 

2 

U 

5 

It 

2.5800 

2.5562 

.0238 

2 

5^ 

3 

1 

2.6500 

2.5U26 

.107U 

2 

5 

li 

2 

2.5950 

2.5362 

.0588 

2 

5 

5 

U 

2.6200 

2.5138 

.1062 

3 

6 

It 

2 

2.7ii50 

2.6MT 

.1039 

3 

6 

5 

It 

2.2675 

2.2992 

-.0317 

3 

7 

3 

2 

2,li700 

2.1;526 

.017U 

3 

7 

U 

1 

2.6UOO 

2.lt575 

.1825 

3 

7 

5 

5 

2.2ii60 

2.2851 

-.0391 

3 

8 

3 

3 

2.3367 

2.3831 

-.0U6U 

3 

8 

U 

1 

2.it500 

2.7376 

-.2876 

3 

8 

5 

3 

1,7767 

1.95U8 

-.1781 

3 

9 

3 

1 

2.6800 

2.3589 

.3211 

3 

9 

U 

3 

2.iil00 

2.36U8 

.OU52 

3 

9 

5 

U 

2.5150 

2.5333 

-.0183 

i-r  y   ( 

.Oii890 

^    is    Tfi" 

33    ■hhrsT! 

on<a.    l-ndi 

natino"   r.lo 

a-rlTT  that  i 

The  ratio  f or  F  (      ,  _  —.^ „ ,  __  __ 

.052Ui  .         «     -^ 

interaction  of  sire  by  age  of  dam  within  lines.  The  analysis  of  variance  can 
now  be  set  up  to  test  the  significance  of  the  effects  for  which  constants  have 
been  fitted. 

The  sums  of  squares  are  computed  from  the  sub-inverse  matrices  and  the 
constant  estimates  by  the  general  direct  method,  B'ZB,  as  follows: 


S.S^84— G  -    ^.08035     .03730] 


(30. 

f  .08035     .03730]  M 
[lit. 


Oh6831    -.029lt01 


-  tl*90385  -.29963] 

-  .litis 
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5^!!i^!S5"' 


rjBP" 


rrrrrsap^'t 


I.Sqs.-- S:G  -    fo905l     -.lOT^o] 


.098039     -.Oia730\  "^ 


Bi-^ 


(-•11917)^ 


■k0U738       .1097261  "  .079911 

r  J"  >13li969  -.Ola78U  -.0^2016]  "^ 

Ll6581t     .0ii098     -.2690^-,OUl781i     .10U^60  -.028235  B-. 

^  ^.052016  -.02t5235     .1200liJAJ  ^ 


•13528  +  .17772  +  .621*78 
.9378 

.06llli    -.03li39lil  "^ 


S.Sqs.—A  =  Co69l*6    -.00611]  Ba 

*■  |-.03U39li     .Oli5l2it 

-  .12U7 

S.Sqs.— GA  -E-.11005     .01311;     .02522     -.08066) 

".132501  -.079153  -.080915  .OU5079 

-.079153  .093828  .Ol;8381i  -.056551i 

-.080915  .OU838U  .126387  -.07li6U3 

.Oli5079  -.056551*  -.07ii61i3  .098938 


-1 


B, 


GA 


-  .l;65l 

S.Sqs.-b  .     1-22S^^  .   .31,82 
.00019093 


S.Sqs.— d  " 


(.001969)' 

iooooi5o3 


.2579 


S.Sqs. — ^Error  -  ?22Z  yiiid  -  R(all  constants) 


382.8998  -  (3.06106)  (156.7U)  -  (-.08035) (-17.69)  

-(.001969)<65,lt28.1}li) 
382.8998  -  380.3931 
2.5067 
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The  analysis  of  variance  table  may  now  be  set  up  as  follows: 


Source  of  Variation 

d.f. 

o.Sqs. 

Lines   {Q) 

.lidB 

Sires  (S:G) 

6 

•9378 

Ages  of  Dam  (A) 

2 

.121+7 

GA 

k 

.1+651 

Regression  b 

1 

.3li82 

Regression  d 

1 

.2579 

Error 

ha 

2*5067 

M,S. 

F 

.076^ 

<\ 

.1563 

2.99^ 

.0621; 

1.20 

.1163 

2.23 

.31+82 

6,67* 

.2579 

h.9k^ 

.0522 

The  tests  of  significance  assTime  that  the  effects  of  sii^s  and  eiror  are  random 
and  that  all  other  effects  are  fixed.  The  lack  of  significance  for  the  GA 
interaction  suggests  that  more  accurate  estimates  of  all  other  effects  could  be 
obtained  by  deleting  the  interaction.  The  short-cut  deletion  procedure  will  be 
illustrated  in  the  next  section  with  crossline  data.  The  remaining  analysis 
with  tliis  example  will  assume  that  the  interaction  effects,  (ga)jj^,  exist  and 

must  therefore  be  retained  in  the  analysis. 

Variance  Component  Estimates 

If  the  classifications  of  G,  S:G  and  A  are  regarded  as  random  the  investi- 
gator is  interested  in  estimating  o^,  ^is>   ^  ^^^  *^ea*  '^'^  compute  estimates 
of  these  variance  con^onents  from  "the  complete  least-squares  analysis  it  is 
first  necessary  to  compute  the  coefficients  for  them  in  the  expectations  of  the 
mean  squares.  Since  all  of  the  subclasses  of  G  and  A  are  fitted  and  the  number 
of  degrees  of  freedom  for  G,  S:G,  A  and  GA  ar«  small  it  is  fairly  easy  to  com- 
pute estimates  of  of.-,,  of  and  o^.  The  expectations  of  the  mean  squares  for 

G,  S:G,  A,  GA  and  errors  are  as  follows: 

E(MS) 
G        °l  *  J^7  °|a*  ^8  ^:g*  ^  <^ 
S:G      c^^k^cf.g 

GA       o|  +  k3  a|a 
Error    o| 

The  coefficients  k^,  k^,  k^,  and  k^  may  easily  be  computed  by  the  direct 
method  for  this  particular  analysis.  For  example, 

^  "  \  ^.33781+  +  22.79U76  -  ^(lli,31079  +  lii.31079)] 


J   (38.82181)  -  12, 9U. 
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The  values  for  ko,  kd   and  k^,  when  computed  in  a  similar  manner  from  the 
inverse  matrices  used  to  compute  the  siira  of  squares  directly,  are 

k3  »  6.5*67 
k^  -  15.1*38 
k^  -  6.28U 

Since  all  GA  subclasses  are  filled,  the  coefficients  for  o^^^,  kr  and  ky,  are 
computed  as  follows: 


kj  -  ^k^  -  h.31h 


The  only  method  available  at  present  to  compute  ko  is  the  indirect  method  of 

Henderson,  which  requires  the  inverse  of  the  1^x1^  least-squares  coefficient 
matrix  when  the  g.  are  omitted  from  the  model  as  well  as  the  associated  suns 

matrix  of  the  same  order.  Since  the  indirect  method  was  illustrated  with  the 
numexacal  example  of  the  last  section,  kg  will  not  be  computed  for  this 

analysis.  The  estimates  of  o^,  of,  and  of.g  are  computed  as  follows: 


d.f. 
6 

Mean  Squares 

.1563 

S(MS) 

S:G 

o|  +  6.281;  cg.g 

A 

2 

.062U 

of  +  S^hl?   o|a  + 

GA 

k 

.1163 

o|  +  6,967  o|^ 

Error 

hQ 

.0522 

<4 

°l 

=  .0522 

g|   -  -.0026 

°i 

»  .0092 

<^:g-  -01^ 

I5.ii38  of 


Method  2  of  Henderson 

Frequently  with  animal  breeding  data  such  as  these,  the  g.  and  s- j  will  be 

genetic  classifications  for  which  variance  component  estimates  are  desired. 
All  other  effects  in  the  model,  except  the  errors,  will  be  fixed,  V/hen  the 
numbers  of  degrees  of  freedom  for  the  genetic  classifications  are  adequate  the 
complete  least-squares  analysis  will  usually  be  impractical.  The  modified 
least-squares  analysis  developed  by  Henderson,  which  provides  exact  least- 
squares  tests  of  significance  for  fixed  effects  when  no  interactions  exist 
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among  fixed  and  random  effects  and  unbiased  estimates  of  the  variance  com- 
ponents, will  now  be  illustrated  with  the  present  set  of  data. 

Step  1>  Absorption  of  the  cc+gj^+Sij  equations* 

The  N^D^^N  values  are  presented  in  Table  5  and  the  S-N*D"-^1T  values  (the 
reduced  matrix)  are  presented  in  Table  6.  These  values  were  computed  from  the 
original  least-squares  equations  in  the  same  manner  as  was  illustrated  with  the 
numerical  example  of  the  last  section. 

Step  2.  Imposing  the  restrictions  that  Z  aj^  =  j(ga)jj^=  2(ga)3j^  -  0. 

The  reduced  set  of  equations  for  the  aj^,  (ga)jj^5  b  and  d  after  completing 
the  appropriate  subtractions  by  columns  and  rows  is  presented  in  Table  ?• 

Step  3*  Inversion  of  reduced  matrix  and  calculation  of  constant  estimates*. 

The  inverse  of  the  reduced  matrix  for  the  aj^,  (ga)^,  b  and  d,  (C),  is  the 

same  as  given  for  this  segment  in  Table  3»  The  estimates  of  these  constants, 
obtained  from  solving  the  reduced  set  of  equations,  are  identical  with  those 
obtained  vrhen  the  complete  matrix  was  solved. 

Step  it.  Least-squares  sums  of  squarer^. 

The  sums  of  squares  for  A,  GA,  Regression  b  and  Regression  d  are  the  same 
as  given  in  the  complete  least-squares  analysis. 

Step  5«  Analysis  of  variance  for  fixed  effects. 

The  analysis  of  variance  for  these  effects  is  the  same  as  was  given  for  A, 
GA,  Regression  b.  Regression  d  and  error  in  the  complete  least-squares  analysis. 
When  the  complete  least-squares  analysis  is  not  done  the  error  sum  of  squai^s 
is  computed  from 

If 


as  was  indicated  previously. 


Since  the  g^   and  b^^  are  now  assumed  to  be  random  no  exact  test  of  signifi- 
cance is  available  for  the  aj^  effects. 

Step  6.  Computation  of  the  adjusted  totals  for  the  GA  subclasses  and  the  G 
classes . 

The  adjusted  totals  for  the  line  X  sire  subclasses  are  computed  as  follows: 

Y»«  -  19.98  -  (2)(.069U6)  -  (2)(^006ll)  -  (Ii)(-.06335) 

-(2)(-.11005)  -  (2)(,0131ii)  -  a)(.09691)  -  (-.008l^U)(lla7) 

-113- 


n 

c 
o 

•H 
-P 
as 

CT 
ID 


0) 

+ 
•H 
tt) 

+ 

eJ 
a> 
-p 

V( 

o 

c 
o 


o 

01 

Q 
C 


P 


I 


n 

0) 

P 

iH 
IS 

>• 

0) 
■P 
(0 


ID 
-P 
C 


O) 
E-" 


« 


<T> 


<J3 


<(0 


CM 

<a) 


5 


oo 

CM 


OO 


CM 
CM 


r-t 
CM 


O 


XA 
O 


O 


UN 


CO 


On 
CM 


O 
O 
O 


O 
O 

o 


rH 

o 

CM 

On 

UN 


13    ^    ^ 

OS       irv       NO 

CTn         ^        _3- 


OO 


_3 
CM 


CM 

r-i 
OO 

CM 


ON 

vO 

OO 
OO 


XA 
On 
U\ 
O 


^0 

XA 
O 


On 
CM 


CM 

-3- 


O 


lA 

OO 


8 

o 


o 

CM 


CO 
CN 
CM 


CM 


On 
OO 


CJN 
CO 


o 


f*N  O  t^ 

NO  -3  O 

NO  -3  cn\ 


O 

o 

O 


_3 
* 

-a 

CM 
VA 


lA 
UN 
UN 


_3 

o 


ON 

NO 

03 
OO 


UN 

<^ 

o 


UN 
CO 
CM 


On 
CM 


1^ 
UN 


8     8 

o      o 


o 

8 


01 


UN 


UN 
CJN 
On  C^ 
On  C>v 


Nf)  OO 

On  vO 

r^  On 

UN  ,H 


O 
r»N 


U^ 
O 


R    ^    ^. 


CM 

NO 


OO  0-. 
OO  CTn 
rH         UN 


CM         UN 


O  UN 

r~  OO 

•  • 

CO  r-i 

OO  rH 

OO  r^ 


o 

OO 


NO 


o 

UN 


UN 
NO 


o 


-3  On 

l-H  CM 

O          P-  _3 

UN  rH 

•  • 

l-l  CM 

C>-  -3 

UN  H 

O        CO  t^ 

C~-  O 


««^  CM 

r^  NO 

o^  O 

\A  CO 


\A 

NO 

OO 


o 

CM 


o 
o 


On 
On 


r^ 


NO 


CJ 


ON 

NO 

•  • 

XA         t^ 
CTn         On 

-3  CO 


f»N 


C»N 
On 


NO 


CM    Ux 
f^    OO 

NO    r^ 

t-T  XA 
CM    nO 


CO  <^ 

C^N  NO 

CO 

UN  O 


o 


UN  NO 
<»N  CD 
O         CO 


O 

lA 
O 


(^ 

lA 
On 
UN 
O 


r-t 


CM 


P     CM 
On    iH 


o 

r- 

XA 

OO 

-^ 

o 

CM 

O 

NO 

O- 

• 

• 

• 

• 

* 

r^ 

C7N 

CM 

o 

o 

O 

o 

CN- 

C-- 

n~\ 

r-i 

CM 

On 

nO 

rH 

i-i 

iH 

CM 

_3 

c:j 

CM 

CK 


J3     T3 


0) 

tU) 


(0 


-llli- 


,11,  MMJlWIHIU'IJt.'»P'V^  i...jnii»i.ii" 


>a''ffltWtrr'^ifetf1^i7^.^ifa~^'^-^^'-^ 


I..  ■u^■/H■i^^i.»lll^,l^la.^il^ll^''W'j*'ll'l<wl.»l^^■^.^'W^^Jl^^|||^WlJm■.lW^Wgw.wn^l^|>s 


m»j|iu>Ai|l»lJ|iup«i|PHinii 


s 

pi 


CO 

c 
o 

•H 
-P 
(0 
P 
CT 
V 


+ 

+ 
in 

<^ 
o 

c 
o 


o 

(O 

0) 
01 


T3 
C 


0) 


3^ 

o 

j:: 

u 


I 

O 

I 

CO 


CO 

c 
o 

•H 
+> 
(TI 

3 

cr 

0) 

0) 

<u 

)h 
CO 


I 

(0 

a 

0) 


0) 
(0 


CO 


CM 


<«0 


.(8 
■bO 


[05 


.CO 

•60 


'bO 


'  bO 


>  CO 


cat 


eg 

c  a) 


<(0 


0       vrv       vo 

0 

in 

u> 

-3 

00 

-3 

r- 

CO 

XA 

.  r- 

C? 

_3         VA          (?^         Cn 

1  rH 

0 

0 

NO 

1    NO 

CM 

ro 

^ 

CO 

0 

C 

NO 

NO 

0 

CJ 

CJ 

rH 

rH 

0 

On 

8 

CO 

0 

On         C\J 

rH 

rH 

1  ro         U> 

0 

-3 

1    -=I 

0 

0 

1  ''J^ 

CN 

rH 
1 

1 

1     1 

1 

rH 

i-H 

<V 

1  '^ 

I        1 

On 
CO 

t^      _a 

i-H 

U> 

U> 

0 

P 

? 

Is 

CJ 

CN 

rH 

.    CN" 

f^ 

CO 

CNl 

r- 

0 

1  0 

rH 

U\ 

rr\ 

NO 

0 

1    NO 

rr\ 

d 

r- 

CvJ 

\A 

1  t^ 

CM 

C-- 

CU 

1  ^ 

CO 

>o 

VA 

1  c^ 

rH 

CX3 

r^         rH 

CJ 

.  iH 

_3 

-3 

-3 

1 

rH 

CJ 

.    rH 

rH 

^ 

1 

CJ 

r-i 
1 

1     t 

iH 

' 

1   rH 

On 

CD 

\J> 

m      -LP 

1  0 

VTv 

^ 

0 

1    0 

rr\ 

lA 

CO 

1^. 

r-- 

r>- 

lA 

_: 

rr\ 

00 

vr\ 

<D 

>- 

0 

NO 

ra        U> 

0^ 

_= 

0- 

_^ 

_3 

rr\ 

CO 

en 

C^ 

NO 

On 

rH 

rn 

-3 

OJ 

1 

1  C\J 

1 

CJ 

rH 

1  c^ 

1 

rH 

1 

1 

1  rH 

r- 

On 

vO 

C~- 

CN 

NO 

in 

NO 

CM 

\S\ 

NO 

CJ 

m 

00 

CNJ 

0 

0 

0 

0 

0 

0 

r^ 

CO 

CM 

0 

00 

On 

0 

CO 

On 

• 

• 

• 

• 

• 

« 

rn 

rr- 

NO 

m 

rr\ 

MD 

1 

1 

I 

1 

0 

U\ 

On 

NO 

lA 

rn 

0 

NO 

ro 

0 

lr\ 

_3 

CD 

0 

0 

0 

0 

0 

0 

u^ 

_3- 

0 

CN 

CO 

0 

On 

• 

• 

• 

« 

• 

tH 

_3 

1 

rH 

1 

-3- 

rn 

NO 

t^ 

c^ 

On 

r^ 

lA 

On 

oc 

u^ 

r^ 

0 

0 

0 

0 

0 

0 

CO 

0 

0 

0 

0 

• 

• 

• 

• 

~=1 

• 

1 

-3 

_3 

On 

r-\ 

-3 

On 

.  '^ 

•-i 

C\J 

-3 

r-H 

CM 

1      "^ 

t~~ 

_3 

rH 

0 

0 

0 

C^ 

_3 

rH 

UA 

rH 

r- 

UN 

rH 

1    1^ 

• 

« 

• 

• 

• 

• 

rH 

1 

C\J 

rr\ 

iH 
1 

CM 

1 

1   <^ 

C^ 

NO 

On 

CN- 

NO 

XA 

CD 

CNJ 

\A 

CD 

00 

CJ 

-3 

0 

0 

0 

CO 

CM 

r- 

On 

rH 

c^ 

On 

r 

CO 

CM 

• 

1 

CJ 

r-l 

C^ 

_n 

rH 

C^ 

u\ 

rH 

f- 

U\ 

ao 

r- 

C 

0 

0 

U\ 

r^ 

C-- 

U\ 

r^v 

• 

• 

• 

• 

C\J 

' 

rH 

CJ 

0 
0 

8 

8 

0 
0 

18 

8 

0 

0 

0 

0 

Q 

0 

0 

!-H 

t^ 

NO 

1  rH 

r- 

rH 

1 

1 

-3 

r-i 

1 

1  f^ 

-3 

8 

8 

8 

0 
0 

18 

0 

0 

0 

0 

,  0 

f^ 

CO 

rH 

^- 

Ico 

• 

• 

• 

• 
t 

1  <^ 

8 

0 
0 

0 
0 

8 

c 

0 

0 

0 

ra 

f^ 

NO 

rrj 

• 

• 

• 

• 

CO 

1 

r-t 

1 

CVJ 

rH 

CD 

(Jn 

C^ 

CTn 

NO 

0 

OJ 

n-\ 

CJ 

l-l 

rr\ 

• 

• 

NO 

OS 

u^ 

1 

1 

r-i 

en 

0 

On 

On 

rn, 

^O 

\r\ 

0 

• 

• 

CNJ 

1 

.J 

0 

1 

-3 

00 

•« 

•• 

•• 

•« 

•• 

1  ** 

•  • 

1— 

i             C 

Nl         r^ 

\           r- 

H        CNJ 

m 

rH 

CJ 

rr\ 

rH 

OJ 

<^ 

CO 

CO 

CO 

^  1    .1 

rH 

CM 

CM 

1       CJ 

<^ 

rn 

en 

1  ^ 

T3 

^~N 

•~Si 

^■•^ 

^— ^ 

^■^^ 

^-N 

^~»» 

^■^ 

*r^ 

1    n) 

CO 

<0 

CO 

i     CO 

lO 

CO 

CO 

CO 

60 

bO 

bO 

bo 

bO 

bO 

bO 

bO 

bO 

1  '— ' 

■ — ' 

— ' 

>-' 

1  ■ — ' 

^-' 

■ — ■ 

•>— ' 

-115- 


■LO        <v^        tv_ 

CO         On     OJ 

On      CO       vO 
1A       O       <5 

On      |r^     rH       C>- 

CO       CO    xr\     OO 

CM 
C\J 

§ 

c»-\      l-rv       CO 

OJ      |VA     ro     CO 

o 

3^ 

t>-     m      o 

_cr        CO      CO      "UA 

On 

<D 

K 

O       -a        <A 

On      \\r\     On          . 

^ 

•             •             • 

•            • 

>        fO 

On 

cU 

CJ        rH        m 

CA     |f»>    <^     m 

CO 

t 

'      1  •       • 

1 

<^     -=r      r-- 

VO         C^    NO 

CM 

f«- 

^ 

Z>     ^      '^ 

t^      IMD     r—      NO 

ro 

+> 

QO        U\        rH 

-=I            NO        0\               • 

• 

■iS) 

^O 

CN        m       CO 

•           •            • 

OJ       m       r<-\ 

CN      _:3       r^ 

-^      IrH     r- 

•            •        • 

C^      |On       1 

o 

CO 

m 

C 
o 

•H 

• 

<^        CO         CO 

r-^       1    • 

o 

rH 

-P 

ro 

1        1        1 

«        , 

rH 

On 

c 

vO        On       vO 

r-^         -=i     r-\ 

^ 

a 

\0       vO       vO 

f^      frn    ro 

o 

■p 

iH        CO         rH 

rH        OO     NO 

• 

rH 

01 

CQ 

rn       CO        O 
•          •          • 

■"^      lO     OO 
•            •        • 

On 
, — 1 

O 

o 

p^     vr\      o 

-^       lOJ      r-\ 

_^ 

o 

O        CO        _3 

PA        -ZJ     U-\ 

•t 

sC^ 

•o 

1 

CO         OJ        NO 

1 
<^       j-3    -3 

t^ 

rH 

•D 

CM 

O       r^       NO 

O          CM      C^ 

Q 

C 

CSJ 

oo       r^       O 

On       lUA       -; 

•• 

n) 

»^^ 

QD        OO         rH 

'7Z. 

^ 

-=r        O         On 

fH      t-~      r-- 

oo      (rr^    sO 

1 

•> — ' 

•          •          • 

•     1    •      • 

C>.        CTn    o 

>H 

^ 

nO        CO         CJ 

•H 

1            I         rH 

r-i        \r-\      f^ 

B 

^~^ 

NO      CO       r»-\ 

•o     !o 

cd 

C9 

(H 

-^       O        m 

nO           cm 

bO 

CO 

O        CO          f"^ 

NO      |nO 

a 

^-^ 

»'~^ 

On      oD        m 

x-\          t>~ 

rH 

<?„ 

nO      -a      CO 

On       ICN 
r^          CM 

1 

•> 

*bO 

CO        i-H        o 

o 

-H 

^*-^ 

•        •         • 

•       1     • 

CO             NO 

^ 

at 

r-     no      c^ 

;3 

a> 

1             i          r-{ 

iH       ICM 

CO 

-p 

CJ 

nO       r^      NO 
O       O       nS 

O 

<1> 

iH 

NO        On       nO 

On 

X 

^ 

r w 

rH        NO         rH 

nO 

-P 

o 

<ff. 

On        m        On 

ro 

V^ 

bO 

O        On       .3 

ro 

c 

^--^ 

•          •          • 

« 

•H 

n 

-::       -Cf        rH 

-^ 

C 

1            1          CJ 

oo 

T3 

o 

0) 

•H 

r^      NO       p^ 

•H 

■p 

iH 

<^      O        n^ 

fc 

§ 

fH 

<^      O       r^ 

^-» 

«^      rH        r^A 

«3 

o* 

<? 

CO        On       CO 

O 

o 

<tjO 

oo       O       CO 

v.-*' 

•          •          • 

<U 

TD 

O       -::l       C^ 

;h 

(S 

1             1          CO 

OJ 

O 

:2 

S 

-3      c^ 

• 

•a 

CO      r- 

CO  NO 

<i> 

1^     -zr 

r-\ 

K 

OJ 

-cr     XA 

a  "O 

<« 

rn      NO 

E   C 

t-<         CM 

•H    CO 

• 

•           • 

O 

r«- 

CO         LTv 

flJUN 

CM        .3 

TD 

0) 

n 

t-^ 

o 

rH    <D 

^ 

CM 

«J  r-K 

^ 

-Q 

C  ^ 

E-i 

t>- 

O    CO 

rH 

On 

^e-^ 

<nJ 

-zt 

-p 

• 

•H    C 

NO 

-O  -ri 

<n 

•o 

(0    C 

*•              ]     ••              M 

•  • 

•  • 

OJ 

r-{        C^         rA 

C\J           rH     CM 

o    > 

«>          «S            rH        rH    1     OJ     CM 

^ 

Xl 

&X, 

OJ 

s 

00 

1  0) 

^ 

-116- 


^muke^narxiivm^^^ 


T'liiiiPuiiM  M!ifi;j*5^iiw 


m^<>^i«    rtnlw        t?«»nrftf?LlC.H3n    Ol     I'.hJt    It'-im    A.1nv    r:>-»no  tifc«»«    /^/Nnv/rtn>>»    lo  f  r  in  n  o  m  *n  I 


.(*0015»65')(3^01) 

•  2it.5736 

y"     "    18.09-  (2)(-.006ll)-.(6)(-.0633^)-(2)(.0131it)-(6)(.09691) 
12. 

-(-.00815U)  (lli57)-(  .001969)  (3i436) 

«  22.9895 


yii     -  lii.9909 
13. 

Yy     -  23.8369 
21. 

y»»     -  22.5260 
22 


y»»     »  19.6223 
31. 

Y' •     -  25.l6ii3 
32. 

Y»»     »  19.9128 
33. 


Y'«     =25.33U8 
3U 


< 
^ 


The  ad;5^i5ted  totals  for  lines  are 

y*     •  2li.5736  +  22.9895  +  ili.9909  -  62.551iO 
1*. 

Y»»     -  li6.3629 
2.. 

ytt     »  90.03l]2 
3.. 

Step  7.     Computation  of  sums  of  squares  and  variance  component  estimates. 

The  adjusted  sums  of  squares  for  line   (gO  and  sires  within  lines   (S':G') 
may  now  be  computed  as  follows: 

^  21  15  29  65 

-  609.1571  -  608.9li68 

-  .2106 

S.  Sqa.-S.  :G.  -  i2h,p^,ill:m)l  .  .  .  iSl^^MI!  .  609.1S7U 

0  0  c3 

-  610.2110  -  609.157U 

-  1.0536 

The  analysis  of  variance  for  lines  (G' )  and  sires  within  lines  (S'  :G' )  is  set 
up  as  follows: 

.117- 


t. 


I 


d.f«  S,Sqs.   M.S. 
2    .2106   .1053 


^12  ^e 
^10  oi 


E(MS) 


^22  ^:g 


S»:G    6   1.0536   .1756 
Error   W  .0522        of 

The  coefficients  for  of.g  and  Og  om  the  E(MS)  are  computed  as  follows: 

(8)2+(8)2-H(5)=   (8)2+(7)2   (6)2+(8)2*(7)2+(8)2 


kii  =  ^(65- 


21 


2$ 


29 


) 


i(65.-  22.1639) 
6  - 


~(U2.836l) 


-  7.139 
k„  -  i  (22.1639  -  (a)'^(8)-^(5)-  ^  ^  ^  (8)'  ^ 


--i- (22.1639  -  7.3692) 

-  ^(Ili.79h7) 

-  7.397 

„  1  f.^      (21)^-H(15)^-H(29)^  . 


^U4 


-  ^(6$  -  23.3185) 
-^(la.6835) 

-  20.81a 

The  coefficients  for  o^,  k_  and  k-«,  are  computed  fjrom  the  segment  of  the 

matrix  inverse  to  the  variance-covariance  matrix  corresponding  to  the  effects 
which  remained  after  absorption  and  the  P.  .  matrices,  i.e.. 


k^  .  1  .  1  |ec«(p|^  -  p.  j)j  . 
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The  P*  *  matrix  was  computed  dui-ing  the  absorption  of  the  cc+g^+s^j  equations  and 

is  given  in  Table  ^,  The  P'  matrix  is  given  in  Table  8  and  the  P  matrix  is 
given  in  Table  9.  These  matrices  ivere  computed  in  the  manner  previously  de- 
scribed from  certain  coefficients  in  the  original  set  of  least-squares  equaticrs 
(Table  1),  The  P''-P'  matrix,  after  completing  the  appropriate  subtractions  by 
rovs  and  columns,  is  given  in  Table  10,  The  reduced  P'-P  matrix  is  given  in 
Table  11. 

The  coefficients,  k-,Q  and  k-jp,  are  now  computed  as  follows: 

k^Q  -  1  +  |-  r(.0601lU)(2.6256)+(-.03ii39U)(.6l91)  +  (.010960)(-.ii303) 

+  +  +  (,00001503)  (7376)] 

-  1+  i  (.7188) 
6 


^12 


«=  1.120 
=  1  + 


J   ^.060111i)(.26l6)+(-.03ii39ii)(.l696)  +  (.010960) (1.3137) 
(.00001503)  (10,!i85)] 


+  +  + 


=  1  +  |-  (1.1379) 

»  1.569 

The  computed  mean  squares  may  now  be  set  eqiial  to  their  expectations  and 
the  variance  component  estimates  computed. 


d.f. 


G' 

2 

S»  jG» 

6 

Error 

U8 

Mean  Squares 

.1053  =  1.569  Si  +  7.397  ^^^  ^   20.8U1  3| 


^3:g 


g 


.1756  =  1.120  Si  +  7.139  Si:g 


oi: 


.0522  =  of 

.1756  -  (1.120)(.0522) 


:g 


S| 


.016U 


7.139 

.1053  -  (1.569) (.0522)  -  (7.397)(.Ol61i) 

20.8ia 


=  -.00li7 


I 


i 


The  estimate  of  .Ol61|  for  o|.g  is  essentially  the  same  as  that  obtained 
from  the  complete  least-squares  analysis  where  C7|jg  was  estimated  to  be  .0166. 
The  descrepance  is  due  to  rounding  errors. 
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Table  10.  The  reduced  P»i-P«  matrix. 


a^     a^    (g^^ii  (ga)i2   (ga)^-,^  (ga) 


22 


ai:     2.bZbt 

a^: 

(ga)-Q' 

(ga)-j^2** 

(ga)2-LS 

(ga)^^= 

b: 

d: 

2.62^6  .615'1  -*h303 

>8270   .38^5 

2.5063 


.3855 

-1.3ii92 

-.0612 

-20.67 

19  .lU 

-.3177 

-.0812 

-.5655 

-23.25 

11.88 

•5953 

l.li683 

.10l;9 

-17.36 

-2.82 

.8223 

.101;^ 

.5700 

12.87 

1.7U 

1.587U 

.1286 

-li;.60 

-33.23 

.57ii5 

llt.81 
15-67 

-9.52 
1002 
7376 

Table  11.  The  reduced  P«-P  matrix. 


^1 

4V 

^sa)ii 

(£3.)j2 

(ga)2i 

(g'a)22 

b 

d 

^1- 

.2616 

.1696 

1.3137 

1.0908 

.11+15 

.1530 

-U.ii8 

-iiS.OO 

^' 

.1229 

.7060 

.5777 

-.15U8 

-.1109 

-10.146 

-35.76 

(ga)^^- 

8.2102 

6.9129 

y.hhQl 

3.1033 

61.37 

-189.36 

(ga)22- 

5.8255 

3.026U 

2.7163 

$S.96 

-l5U.lii 

(ga)2i: 

U.7301 

U.0521 

3ii0.28 

62.17 

(ga)22* 

3.U75U 

119.09 

1;7.09 

bJ 

■-: 

iai53 

3528 

d: 

10,Ii85 
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MODELS  FOR  THE  AMLYSIS  OF  DATA  FROM  CROSSBREEDING  EXPERII^NTS 

The  general  least-squares  procedures  for  the  analysis  of  data  from  single 
crosses  only  were  developed  by  Kenderson.ll/  The  designs  of  many  crossbreeding, 
or  line  crossing  experiments  result  in  data  imich  may  be  analyzed  in  several 
ways.  Three  methods  of  analysis  will  be  considered  in  this  section:  first, 
the  appropriate  analysis  for  data  obtained  when  all  possible  matings  are  made 
between  lines  but  no  linebreds  are  produced;  second,  an  analysis  of  all  data 
from  linebreds  and  linecrosses  produced  together  wherein  the  linebred  effects 
are  estimated  independently  of  the  general  combining  ability  and  maternal 
ability  effects;  and  third,  an  analysis  of  all  data  from  linebreds  and  line- 
crosses  produced  together  wherein  the  linebred  effects  are  estimated  simul- 
taneously with  the  general  combining  ability  and  maternal  ability  effects. 

Analysis  of  Linecross  Data 

The  analysis  of  data  in  which  linebreds  are  not  included  will  be  con- 
sidered first. 


Model 

The  uiiderlying  mathematical  model  for  the  analysis  when  linebreds  are  not 
included  is: 

yi;Jk  -  M.  +  gi  +  gj  +  mj  +  Cij  +  r^j  +  ei^j,  ....   (7) 

where  y^-j^  is  the  k"''"  observation  on  the  progeny  from  the  i"^^  line  of  sire  and 

the  j"^^  line  of  daia,  ji  is  the  over-all  mean  when  equal  subclass  numbers  exist, 
gj^(gj)  is  the  general  combining  ability  effect  for  the  i^^  (j^^)  line,  m^  is 

the  maternal  effect  for  the  j^^   line  of  dam,  Cj_-j  is  the  specific  combining 

ability  effect,  r^^^  is  the  sex-linkage  or  reciprocal  effect  and  e±^i^   is  the 

random  error,  assumed  to  be  NID(0,Oq),  The  gj^Cg^)  are  assumed  to  be  one-half 

the  additive  genetic  value  (breeding  value)  of  the  i  ( j^)  iine,  and  the  value 
is  expressed  as  a  deviation  from  [j,.  The  m^  measure  the  pre-natal  and  post- 
nasal mothering  ability  of  a  line,  and  each  is  a  fvmction  of  the  genotype  of  a 
line  rather  than  of  the  genes  transmitted  to  the  female  progeny  of  the  line. 

The  Cij  are  effects  in  addition  to  the  gj_  and  ra^  effects,  and  are  measured 

over  the  progeny  of  the  i'th  line  of  sire  with  the  j"t^  line  of  dam  and  the  pro- 
geny of  the  j'th  line  of  sire  with  the  i't-h  line  of  dam.  The  c^j  measure  how 

much  better  or  poorer  than  average  are  the  means  of  reciprocal  matings  than 
would  be  expected  on  the  basis  of  exact  knowledge  of  the  additive  genetic  val- 
ues and  maternal  values  of  the  lines.  The  r^^-t  are  effects  in  addition  to  the 
additive  genetic,  maternal,  and  specific  effects  and  are  measures  of  the  diff- 
erences between  reciDrocal  crosses  after  account  has  been  taken  of  the 


13/  Henderson,  C.  R,  19 h9.  Ibid. 
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differencas  in  maternal  ability  between  the  1^^   line  and  the  j"^^  line.  For  a 
more  detailed  discussion  of  these  effects,  see  Henderson's  thesis. 


ft 


below: 


least-Squares  Equations 
The  least-squares  equations  for  model  (7)   are  presented  in  tabular  form 


M- 


Sh 


m 


J 


cij 


rij 


RHM 


Si 


n,, 


ni.+n.i  n,-j         nij+nji         n^j 


^r^ji 


"i. 


+n.. 


n^^+n.i 


^j         ^r"Ji         "ij 


I.. 


^i.^Y^i 


"ij'^^ji 


n 


•J 


^d 


O-io^ 


^i;3 


n 


ij 


^J-'^di  °ij'"^ji 


0 


nij       Qnij+nji-       n±^ 


^iJ 


^ij 


^3 


iii 


ij     o'^iJ 


Y. 


'i^Yji 


^id 


These  equations  differ  somewhat  from  the  least-squares  equations  presented 
previously,  in  that  the  off -diagonal  elements  in  the  segment  of  the  coefficient 
matrix  concerned  with  general  combining  ability  effects  are  not  zero.  This  is 
a  result  of  the  fact  that  each  crossbred  group  is  used  in  estimating  the  gen- 
eral combining  ability  effect  of  two  lines.  For  example,  a  cross  between  a 
sire  of  line  1  and  a  dam  of  line  2  is  used  in  estimating  the  general  combining 
ability  of  both  lin^l  and  2,  Thus,  the  sum  of  the  coefficients  for  the  g^  in 

the  (J,  equation  is  equal  to  twice  the  coefficient  for  |j..  However,  the  sums  for 
the  coefficients  for  the  mj,  ^±is   s^^i  rij  effects  in  the  \i  equation  are  equal 

to  one  another  and  also  equal  to  the  coefficient  for  \i»     In  addition,  the 
totals  of  the  RHI-I's  for  all  equations  other  than  the  g-j_   equal  the  grand  total 

Y,,  ,  The  total  of  the  PliM's  for  the  gj_  equations  is  equal  to  2Y,,  ,  Before 

the  coefficient  matrix  can  be  inverted  to  obtain  the  inverse  elements  for  use 
in  making  the  appropriate  tests  of  significance,  certain  restrictions  must  be 
imposed  on  the  constant  estimates. 

Imposing  Restrictions  on  the  Constant  Estimates 

Convenient  restrictions  that  can  be  imposed  on  the  constant  estimates  are 

?ii  -  ^^   -  zSij  -  Z^ij  »  Tzcij  -  2?ij  -  Zrij  -  rij  +  ?ji  -  0 

The  restrictions  that  2gj_  -  Zraj  =  0  may  be  imposed  by  completing  subtractions 
by  rows  and  columns  as  explained  previously.  The  subtractions  and  additions 
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required  to  impose  the  restrictions  on  the  estimates  for  the  specific  effects 

p-1^ 
(Zc^  4  =  Zcj^  =  Z  Zc^  4=0)  will  be  given  for  the  specific  design  of  all  possibOe 

crosses  among  four  lines. 

If  the  numbers  in  the  subclasses  concerned  with  the  measurement  of  spe- 
cific effects  are  represented  by: 

^12  ^^13  "Hi 

°21  .  "^3  "^h 

""31   ""32  '^31; 

"^la      ^li2  \3 

then  subtraction  of  the  last  column  in  each  row  from  all  other  coefficients 
results  in: 

"l2""lii  "l3'''lU  ''lii"''lli 

^l'°2U  "23"'^U  "^ir^h 

''3l'""3U    ''32"^3U  ''3li"''3i; 

'    ^la'%3    ''U2'^3  ^3^3 

Subtraction  by  row  then  3n.9lds: 

"12""lli"°!;2'^"U3    "I3~"lli 

°21-"2i;-"ia''"U3  "23"*"2U 

''3l'^?k'''la"%3  ''32''''3ir%2^%3 

Since  the  specific  effect  for  a  particular  cross  is  measured  over  the  two 
reciprocal  crosses,  these  values  must  be  combined  as  follows: 

'il2-^lIi-nU2"'°U3'''^21'""2U""la''"U3    "l3~"lU"'''31""3U~"ia''"li3 

^3"'^li''''32"''3r''U2''^l43 
or    n^.n^-nt^.2n^3   ^l3-"li;-^3U*^3 

43"^2ii-^3U""U3 
where  n,^^  "  ^*^l*  "%   "  ^*V  ^^^• 
The  third  restriction,  Z  Zc^j  "  0,  results  in  the  following  equations: 
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°i2-°il,'°2li*2^3'°23*"21;*"3U""U3  °  "i2""iii"°23*"3l.  •  •  '  '  '■^'> 
and   °l3-nli,-°3i.'^3'"23'"21.*''3!.A3  '  "is'Vb'^Zl, ^^^ 


The 


A 


restriction  on  the  equations  for  the  reciprocal  crosses  that  Z^±^   + 


2r ..  ■  0  results  in  subtraction  by  row  and  column  by  the  same  procedure  as 

followed  for  the  specific  effects.  The  restriction  that  the  reciprocal  effects 
for  each  cross  must  sum  to  zero  (r.  .  ■•■  ^-?j_  ~  0)  yields  the  following  equations: 

^12-^lli"^U2"'^l;3"^2l'"^^U'"V'U3  '  ''l2-^lU"V""2l'''^ii''V  '   '   ^^^^ 

^3""^"^31*^3U"V"^U3       '  ^3'^"^31*^3U"VA3  '  '   ^^^ 

^23"^2r^32*W-^3        "  "23"^2l;'^32"V"l^A3  '  *  ^^^ 

The  subtractions  and  additions  outlined  in  equations  8-12  are  those  that  are 
completed  for  the  specific  combining  ability  effects  and  the  sex-linked  effects 
in  the  \jl  equation.  The  subtraction  and  additions  among  the  coefficients  for 
these  effects  in  all  other  least-squai^es  equations  follow  the  same  form  as 
given  in  the  above  equations.  The  procedures  outlined  in  these  equations  are 
also  completed  in  the  RH4's  for  the  specific  and  sex-linked  effects,  using  the 
appropriate  sums  in  order  to  obtain  the  reduced  RHM's. 

The  equations  for  the  subtractions  required  to  impose  these  restrictions 
on  the  specific  and  sex-linked  effects  will  differ  depending  on  the  number  of 
lines  involved,  Hoi-rever,  the  procedures  shown  above  can  be  utilized  to  de- 
termine the  appropriate  equations.  After  the  coefficient  matrix  has  been 
reduced  there  will,  remain  a  symmetric  matrix  of  order  p(p-l),  where  p  »  the 
number  of  lines  included  in  the  study. 

Inversion  of  the  Reduced  Matrix 

The  size  of  the  resulting  least-squares  matrix  is  normally  so  large  as  to 
make  its  inversion  by  means  of  a  desk  calculator  extremely  laborious  if  not 
prohJ-bitive •  Programs  are  readily  available  for  the  inversion  of  matrices  by 
high  speed  computing  equipment,  l^.ere  the  size  of  the  matrix  dictates  such 
action,  partitioning  of  the  ma.trix  may  be  required. 

Computing  the  Least-Square  Means  and  Standard  Errors 

The  equations  may  be  solved  directly,  at  the  same  time  as  the  inversion  of 
the  matrix,  iji  order  to  obtain  estimates  of  the  constants  |a,  gj_,  m^,  c^^,   and 

rj_j.  However,  it  is  sometimes  more  convenient  to  obtain  the  constants  from  the 

inverse  elements  and  the  RHM»3  of  the  equations  be  means  of  the  formula  pre- 
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Tioiisly  presented, 

ZG^JY^  -  Ci 

where  C^ J  is  the  inverse  element  for  the  i'*'^  row  and  j"^^  column  of  the  inverse  " 
matrix,  Yj  is  the  RHM  for  the  j'^   row  and  Cj_  is  the  estimate  of  the  i'^h  con- 
stant. The  constant  estimates  for  the  model  under  discussion  will  include  the 
least-squares  mean  and  the  estimates  of  the  general  combining  ability  effects, 
the  maternal  effects,  the  specific  combining  ability  effects  and  the  reciprocal 
effects  expressed  as  deviations  from  the  least-squares  mean. 

In  order  to  estimate  the  standard  errors  of  the  mean  and  other  constant 
estimates  it  is  necessary  to  estimate  the  error  mean  square,  of,  which  is  given 

by 

^  °  n..-p(p>l)    l]l  ^hk   -  R(l^,g,m,c,r) 

•fcrtiere     7— tt  "  degrees  of  freedom  for  error, 

n,.-p(p-l)     ^  ' 


z 

22Z  y±Ai.   =  total  uncorrected  sum  of  squares, 
ijk   -J 


and  R(n,g,m,c,r)  =  reduction  in  sum  of  squares  due  to  fitting  all  constants. 

The  reduction  due  to  fitting  all  constants,  R(ix,g,m,c,r)  can  be  computed 
from  all  constant  estimates  and  the  original  set  of  RHM's  as  follows: 

R(ti,g,m,c,r)  =  tiY..  +  g-j_  (Y-^^  +  Y.^)  +  .  .  .  .  +rpp  Y^^. 

The  reduction  due  to  fitting  all  constants  can  more  easily  be  calculated  from 
the  constant  estimates  obtained  from  the  solution  to  the  reduced  matrix  to- 
gether with  the  reduced  PlHM's, 

The  standard  error  of  the  mean  is  obtained  from 

where  CI-'+'-  is  the  diagonal  inverse  element  corresponding  to  the  n  constant.  In 
order  to  estimate  the  standard  errors  for  all  the  remaining  constants  it  is 
necessary  to  complete  the  inverse  matrix  for  the  equations  that  were  subtracted 
out  when  imposing  the  restrictions  on  the  constant  estimates.  Once  the  m&trix 
has  been  completed,  standard  errors  for  constant  estimates,  or  linear  functions 
of  constant  estimates,  can  be  calculated  as  explained  in  the  first  two  sections. 

Computing  Sums  of  Squares  for  the  Analysis  of  Variance 

The  sums  of  squares  for  the  gj^,  m-t,  Cj_j,  and  t^a   can  be  obtained  directly 
using  the  formula  B'Z'-'-B  where  Z"^   is  the  inverse  of  the  segment  of  the  matrix 
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inverse  to  the  variance-covariance  matrix  corresponding  to  the  particular 
effects,  and  B*  and  B  are  the  constant  estimates  for  these  effects.  The  sums 
of  squalls  may  also  be  computed  indirectly  by  differences  in  reductions  after 
obtaining  the  solutions  to  other  sets  of  equations. 

In  this  analysis   the  sums  of  squares  could  be  gotten  as  follows: 

Source  of  variation  Sums  of  squares 

General  combining  ability  R(p.,g,m,c,r)  -  R(iJ.,m,  c,r) 

Maternal  ability  R([a.,g,m,c,r)  -  R(|j.,g,c,r) 

Specific  combining  ability  R(i.t,g,m,c,r)  -  R(n,g,m,r) 

Sex-linkage  effects  R(|i.,g,m,c,r)  -  R(^,,g,m,c) 

The  sum  of  squares  for  error  can  be  computed  as 

2 


m  ^i* 


-  R(^,g,m,c,r) 


as  previously  noted.  Since  all  of  the  variability  among  the  subclasses  is 
accounted  for  by  the  constants  being  fitted,  the  total  reduction  in  sum  of 
squares  can  be  obtained  by  calculating 
Y?. 

2Z  -—  =  reduction  due  to  fitting  all  constants, 

ij  "10 

The  degrees  of  freedom  for  the  model  described  are  calculated  as  shown 
below: 

Source  of  Variation        Degrees  of  Freedom 

Total  n..  -1 

General  combining  ability       p-1 
Maternal  ability  p-1 

Specific  combining  ability      p(p'"3) 

2 
Sex- linkage  effects  p(p-3)  +1 

Error  n,,-p(p-l) 

Thus  it  can  be  seen  that  with  only  thiree  lines  included  in  the  study, 
there  would  be  no  degrees  of  freedom  available  for  the  estimation  cf  specific 
combining  abilities,  and  only  one  degree  of  freedom  available  for  the  esti- 
mation of  sex- linkage  effects. 

Upon  completion  of  the  analysis  of  variance,  it  can  be  determined  which 
effects  are  significant  and  should  be  retained.  If  some  of  the  effects  do  not 
appear  significant,  the  equations  for  these  effects  can  be  deleted  from  the 
model  and  more  efficient  estimates  of  the  remaining  effects  obtained.  The  in- 
verse of  the  variance-covariance  matrix  v/hen  one  or  more  sets  of  effects  have 
been  eliminated,  C7I,  is  given  by 

C7I  «  C-1  -  RZ-lR« 
A     R 
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where  Z  is  the  segment  of  the  matra_x  inverse  to  the  variance-covariance  matrix 
corresponding  by  row  and  column  to  the  effects  being  eliminated,  R(R')  is  the 
segment  of  the  matrix  inverse  to  the  vai'iance-covariance  matrix  associating 
these  effects  vrLth  the  effects  to  be  retained,  and  C^-^  is  the  segnent  corres- 
ponding by  row  and  column  to  the  effects  being  retained. 

The  new  constant  estimates  of  the  effects  to  be  retained,  Bj^,  can  be 
obtained  by  means  of  the  formula 

B^  °  B2  -  RZ'"^B2_, 

where  Bo  represents  the  constant  estimates  for  the  effects  to  be  retained, 

before  adjiistment,  and  B-i  represents  the  constant  estimates  of  the  effects 

being  eliminated.  The  definitions  of  R  and  2  are  given  above.  The  new  con- 
stants can  also  be  obtained  by  omitting  the  RHM's  of  the  effects  being  elim- 
inated and  multiplying  the  HHll's  by  each  row  of  the  new  matrix  inverse.  The 
elements  of  the  new  matrix  inverse  and  the  new  constant  estimates  are  those 
which  would  have  been  obtained  had  the  effects  which  were  eliminated  not  been 
included  in  the  original  model. 

Estimation  of  Variance  Components 

If  all  effects  are  assumed  to  exist  and  the  lines  are  regarded  as  random, 
the  expectations  of  the  mean  squares  are  as  shovm  below: 

Source  of  Variation  E(MS) 

General  combining  ability     of  +  k^c^  +  IC70Q  +  kgOg 

Maternal  ability  o|  +  kro^  +  ^^"^ 

Specific  combining  ability  o|  +  k2<^  +  ^3*^ 

Sex-linkage  effects  of  +  k-,0^ 

Error  of 

The  k^,  ko,  k^,  and  kg  values  can  be  calculated  by  the  direct  procedure 

.  explained  in  the  section  discussing  the  two-way  classification  without  inter- 
action. The  k2,  k^,  k^,  and  kj   values  can  be  calcxilated  by  the  indirect  pro- 
cedure explained  in  the  same  section.  The  variance  component  estimates  are 
computed  by  setting  the  above  expectations  equal  to  the  computed  mean  squares 
and  solving  the  resulting  equations. 

Numerical  Example 

The  computational  procedures  outlined  above  for  setting  up  the  least- 
squares  equations,  imposing  restrictions  on  the  equations,  obtaining  sums  of 
squares  and  degrees  of  freedom  to  complete  the  analysis  of  variance  and 
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estimating  the  components 
cross  data  involving  four 
presented  in  Table  12. 


of  variance  are  illustrated  -with  a  set  of  single 
lines.  The  subclass  numbers,  totals,  and  means  are 


Table  12.  Subclass  nujnbers.,  totals^  and  msans 


T.ine  of 
Sire 

i,ine 

of  Dam 

Sums 

1 

2 

3 

h 

1 

Number 

Total 

Mean 

22.. 
1231.0 

12 

767.2 

63.933 

10 
621.0 
62.100 

12 
787.5 
65.625 

^6 
3ii06.7 

2 

Number 

Total 

Mean 

12 
712.0 
59.333 

26 
1725.0 
66.3h6 

10 

597.lt 

59.7i;0 

10 

610.5 
61.050 

58 
36hh.9 

3 

Number 

Total 

Mean 

12 
771.3 
6U,275 

12 

773.9 

6i;.U92 

12 

61i5.5 
53.792 

12 

799.9 
66,65Q 

li8 
2990.6 

h 

Number 

Total 

Mean 

111 

860.8 

6l.ii86 

8 
U59.2 
57.Ii.00 

12 

60^.1i 
5o.li5o 

12 
731.6 

^.967 

U6 

2657.0 

Sm 

Number 
■^       Tot^T 

60 
3575.1 

58 
3725.3 

hh 

2ii69.3 

li6 
2929.5 

208 
12699.2 

These  dataware  the  28-day  weights  of  rats  resulting  from  an  experiment  by 
Kidwell,  et  al.iid/  The  observations  are  the  sex  by  litter  means  and  only  those 
litters  with  rats  of  both  sexes  are  included  in  the  analysis.  Therefore,  the 
sex  effects  are  orthogonal  to  the  subclass  differences.  The  least-squares  co- 
efficient matrix,  using  the  model  previously  presented  in  this  section,  is 
shown  in  Table  13.  It  can  be  observed  that  the  segment  corresponding  to  the 
rows  and  columns  to  the  general  combining  ability  is  filled  in  completely, 
whereas  the  segments  corresponding  by  row  and  column  to  the  other  effects  have 
entries  only  on  the  diagonals.  In  addition,  the  RHM's  for  the  general  combin- 
ing ability  do  not  equal  the  sum  for  the  \i  equation. 

The  least-squares  coefficient  matrix  iar  then  reduced  by  imposing  the 
restrictions  previously  presented  and  this  reduced  matrix  is  shown  in  Table  iJi. 

If  the  equations  are  solved  at  the  same  time  the  matrix  is  inverted,  by 
including  the  RHM's  in  the  equations,  then  the  constant  estimates  are  obtained 
directly.  If  the  solution  to  the  equations  is  not  computed  directly,  the  con- 
stants can  be  calculated  from  the  inverse  elements  and  the  RHM's  of  the 
equations,  since      ZC''"*^Y 

as  noted  previously.  ' 

Hi/  Kidwell,  J.  F.,  H.  J.  Weeth,  W.  R.  Harvey,  L.  H.  Haverland,  C.  E. 
Shelby,  and  R.  T.  Clark.  Heterosis  in  crosses  of  inbred  lines  of  rats. 
Genetics  1^5:225-231.  I960. 


•j' 


-130- 


■  J,  I  Mii.iJi;t|i,tii)ij;i 


^1^   II ^t 


j*^  ^  «%  rt  41  frit  */m    O  <-^ «t ««  •- 1 #-*  h  ^    if>f  rirt  n  Am  iQ  n  t 


0 

-P 

U 
0) 

c 


1-1  ^ 


CO    0) 


CO  ^ 

o  U 

•H 

f^  + 

e  T^ 

9  -H 

©  + 

u 

0  + 

01  tlO 

c 

o  + 

•H 

■P  ^ 

oj  bo 

or  + 

0) 

n 

(U  R 

0}  ^ 

cr  -H 

CO  >j 

I 

-P  rH 

n  Q) 

CD  T3 


r^ 


J3 
CO 


« 

-3 

CVJ 

-3 

CVJ 

CM 

CVJ 

rH 
CVJ 

-3 
rH 

<»4 

CVJ 

-3 
OS 

-3 
CVJ 

CVJ 

<o 

-3 
iH 

<o 

rs 

CVJ 

iH 

<# 

<r 

CVJ 

<^ 

-3 

<bjQ 

CVJ 

<bo 

<Go 

<a. 

eHOOCVjOvmiH    rr^oOOvCVI<n<*^c^r^<^CVJ     O'Lf\O-3'*^<^0N0\00CVJ_3 


vO    Ov   O   OO  P^ 

vO     pH    CVJ    vO  CVJ 

r^   lA   CK    rH  iH 

CO    -3    rr\  -3  -3 


CVJ   CVJ 


C^C^CJNCVJOD     rH     CJS'Lrvf^rH     ^-CV)r--0 
CVJCAr—    ON-3     (^vO     OvO     CMCOrHCrVrH 

<^OOOrH_3mM3     f»^0-3C^\0     C^C^t^O 

CVJCVJrHCVJrHi-HrHrHrHrH 


8 


CVJ 
rH 


CM 


rH     r^    On    O    CN   lA 
£v-    (^    r--  MD  1-A    O 

r--  r-  r—  CO  -3  vo 


cvji 


CO 


CO 


CO 


CO 


GO 


00| 


-3   -3 


_3  -3 


.St 
t-t 


CVJ    CVJ 


CVJ 

iH 


CM 


CVJ  CJ    CVJ 

r-{  H    rH 


CVJ     CVJ  CVJ  CVJ 


CVJ 


CVJ 


CM 
iH 


al 


o 


o  o 


o 

H 


CVJ    CVJ  CVJ 


CVJ   CVJ 


O    O 

rH     rH 


CVJ   CVJ  CVJ 


CVJ 
rH 


CM 

H 


^1 


CM 


CM 


CM 


CM 


CM 


-3 

CVJ 


CO 


CVJ 
CM 


CM    CM 

CO  CO 


CM    CM 
CM    CM 


CM    CM 


^1 


CM 


CM 


CO 


CM     O 


CO 


CM 
CM 


CM     CM 


CVJ     CM 
CM     CVJ 


CM 


CM 


vol 
CM 


CM 
CM 


CM 
CM 


CM 


.-3-3-3 

CM  CM    CM 

_3  CM    O 

<^  rH    rH 

CM  O    O 

<^  rH    rH 

CM  CVJ    CM 

rr\  r-i    fry 

CO  CO    CVJ 

cr\  f^   i-i 

OD  ^O    CO 

^0  CM    rH 

CO  CVJ    CVJ 

O  CM    CVJ 

-3-3-31 

sO  CM    0\ 


CM     CM 


^1 


CM    -3 


CM    CM 


CM   CO 


CM    -3 


T^l 


o 
i-i 


CM 


CM 


CO 


CM 
fH 

O  O 

rH  r-i 

CM  CM    CO 

iH  rH 

CM    CM   -3 


CVJ 

rH 

CM 


CJ 


CVJ 
rH 


CM 


CM 


^ 


vOICM    -3 
r^lr-  vO 


-3  CO 
CVJ    vO 

0O|_3 
vOjCM 

CM    CO 
I  CM    t-H 

CM    vO 
CM    CM 

CO    CO 
NO  vO 


-3    CO    CM 


-3 
rr\ 


^O 

CM 


CO 


CM     CM 


CM     CM 

r-H    r^ 


CO     CM 
r^    rH 


C\i    CM 

OS   rH 


O 

rH 


CM 
CM 


CM    CO 
CM     rH 


O     CM 


CO     CM 
OS    as 


CM    -3 
OS    OS 


CM 
CM 

-3 
CM 

-3    CM    vO 
CM    CM     CM 


-3    CM    sO     CM    CO 
CM    CM    CM     CM     rH 


CM 


-3 
CM 


CM 

rH 

CM 


CM 


o 

rH 
CM    O 


CM 


CM 
rH 

CM    CM 

rH    rH 

CM 


O     CM 
rH     rH 


CM 

rH 


-3 
CM 


CM     O     CM    CM     O 


CM 


O     CM 

rH     rH 


CM    CM 
rH    rH 


CM 

rH 

CO 

-3 
rH 

-3  CO     CM 

rH  rH 

CM 

rH 

OO 

-3 
r-i 

-3  OO    CM 


^-HCM^O-3c^C'C^r?f<^<*^-^     r^   -^   ~=i    «V     rrv,^rH     fn-3rH     CVJ_3 
hO«iOt)iOt>OE      EEErHrHrHCMCMrr^fHj-HrH<^^<^'^f^'^ 


rH    CM     or\ 

-3-3-3 
f-,    U   .=1 


-131- 


I 'fBWpW!jgpitoa.J|!j^^ 


Table  m»  Reduced  least-squares  equations  and  RHM's, 
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4, 


The  inverse  elements  for  the  above  matrix  are  presented  in  Table  l5,  together 

with  the  constant  estimates.  The  total  reduction  in  sum  of  squares  due  to  fit-     t 

ting  all  constants  can  be  obtained  by  multiplying  the  colnDin  vector  of  all  | 

EHM's  by  the  row  vector  of  all  constant  estimates.  The  same  result  is  obtained 

by  using  the  constants  estimated  from  the  reduced  matrix  and  the  corresponding 

reduced  RHM's.  For  exan^Dle, 

R(^i,g,m,c,r)  '   (6l.378)(8366.1)+(2.9ia)(396.5)+ +(1.229) (-133.3)=  517,127 

#- 
This  reduction  in  sum  of  squares  can  also  be  obtained  from  the  subclass        ^ 

numbers  and  totals  given  in  Table  12,  as  §• 

,   (767.2)2   (621.0)2        (605.l4)^   , 

B.i\i,g,m,CyT)   - +  — +  .  ♦  .  •  +  •*— — ; =  517,131 

12        ID  12 

Since  only  three  significant  digits  "were  carried  in  some  of  the  constant  esti- 
mates the  two  answers  do  not  agree  exactly. 


The  next  step  is  to  calculate  the  error  sum  of  squares.  Normally  this  is 
obtained  in  a  least-squares  analysis  from 

22Z  yiik  -  R(all  constants),  .  . 

ijk   "^ 

However,  in  these  data  the  sex  and  sex  by  subclass  effect  would  remain  in  the       | 
error.  The  sum  of  squares  for  error,  therefore,  was  obtained  by  also  subtract-      |; 
ing  the  corrected  sums  of  squares  for  sex  and  sex  by  subclass  from  the  total 
uncorrected  sum  of  squares.  The  resulting  error  stmi  of  squares,  with  136-2li  » 
122  degrees  of  freedom,  was  found  to  be  5,786,3.  *  § 

Since  the  differences  among  the  r^j  were  small,  and  there  is  little  bio- 
logical basis  to  expect  these  effects  to  exist,  a  test  of  significance  was  made 
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to  determine  whether  this  variable  could  be  deleted.  The  adjusted  sum  of 
squares  for  R,  the  sex-linked  effects,  is  obtained  from   the  general  formula 
B'Z"1b  as  follows: 


S.  Sqs.— R  -  [.767  .981+  1.229]  (10-6) 


^767  .981+  1.229) 


22060     -II38I4       10286 
-ll3Bir      219l;9     -111+58 

10286   -nllFF     2301+7 


-1 


66.3559  25.6001  -16.8877 

25.6001  7I.U053     2i+.07l+2 

-16.8877  2i+.07i+2     62.8951i 


|55.3305  119.1+853  88.03i^6j 


.767 

.98U 

1.229_ 

■  .767 

.981+ 

1.229 


.767 

.981+ 

1.229 


268.21 


The  test  for  the  significance  of  differences  among  the  rj_j  is  then  complet- 
ed as  follows: 


Source  of  Variation         d.f . 

Sex-linked  effects  3 

Error  122 


Sum  of  Squares     Mean  Square       F 

1.89 


268.2 
5786.3 


89.1+ 
1+7  .U 


Since  the  sex-linked  effects  were  non-significant  they  were  deleted  from 
the  analysis  in  order  to  obtain  more  efficient  esti^nates  of  jj,,  gj^,mj,   and  s^j 

the  inverse  elements  of  Table  11+  were  adjusted  by  use  of  the  formula 

c~^  -  c^^rz-1r» 

The  R2r-%'  matrix,  which  was  subtracted  from  the  equations  in  the  segment 
corresponding  to  the  remaining  effects  is  presented  in  Table  16. 

Table  I6.     The  RZ^%'  matrix  used  to  delete  the  r^y   (xlO^) 


gl       g2         23       ^1 


m2 
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3       ^12     ^13 


li     J     28     -33     -23 


g2 

63 
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mo 

"3 
^12 

^13 


-31  8  103  20 

1+8  -91  -91  -3 

35  17  -11+9  0 

U+  -35  -130  6 

-3^  112  35  -116 

103    -91  '2h9    -130  IF"  ila  51 

20     -3      0        6  'kS  "51  50 

-28      33      23        31  -8  -103  -50 

27      28    -63      -22  -51  121  37 


-U  127  -35 

-23  ^^  101 

-31  48  "IF 

8  -91  17 


-28  27 

33  28 

23  -63 

31  -22 

-8  -51 

-103  121 

-20  37 

28  -27 

-TT  68 
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These  values  vere  subtracted  element-by-element  from  those  in  the  segment  for 
the  remaining  effects,  resulting  in  the  new  ijaverse  matrix  of  the  variance- 
covariance  matrix  with  the  rj_-j  effects  deleted.  This  matrix  is  presented  in 

Table  17. 

Table  17*     The  invei^e  matrix  (CJ^)   of  the  Tariance-cgvariance 


matrix  with  the  t^a  effects  deleted. 


(xlO^) 


A 

A 

A 

A 

•K 

A 

A 

A 

A 

i^ 

SI 

22 

53 

mi 

ra2 

m3 

C12 

^13 

M- 

:     7i;70 

-Uoi 

663 

-U53 

-603 

71 

67li 

-526 

982 

^1 

i    -UOl 

2552U 

-93U0 

-69lt9 

-16576 

5039 

5732 

iiOl 

-375 

K?    : 

t      66Q 

-^^liO 

251i9ii 

-8551 

608U 

-17559 

5729 

-668 

-61 

g3  •' 

!    -U53 

-69h9 

'BBBU 

2iill9 

5615 

1;557 

-15619 

U53 

-1317 

mi   : 

'    -603 

-16576 

6C8U 

56i5 

311i36 

-10ii52 

-nlil? 

603 

-395 

m2   i 

71 

5039 

"17539 

li557 

-10li52 

35U96 

-11509 

277 

1268 

\ 

^3  ' 

67U 

5732 

5729 

-15619 

-llhl2 

-um 

33283 

-67h 

310 

I 

o-\p: 

-526 

liOl 

-668 

ii53 

603 

277 

-6?U 

l)tln5 

-7926 

i 

C13: 

982 

-375 

61 

-1317 

-395 

1268 

310 

■-'ms 

15937 

I 

Since  the  r^-;  were  assumed  to  equal  zero,  these  PlHM*s  were  omitted  and  the 

new  constant  estimates  obtained  as  before,  by  multiplying  the  Pi24's  by  each  row 
of  the  new  inverse.  These  constants  are  shown  below: 

|1  =  10-6  ^(7^70)  (8366.1)  +  (-U01)  (396.5)  +••••  + (982)  (-557.6)}-  61.35 

gl  -  2.9U        \  '  -1.53         c^  -  -1.33 


g2  "1.3h 
g^  -  2.73 


mg  -  1.38 
m^  -  -5.89 


'13 


-.22 


The  remaining  constants  (gL^mj  >c-y  ,C2^,c«|  ,c-.  )  can  be  obtained  by  differences, 
since  gi+g2'''g3"*"g||"  mi+m2+m2+m]^=  0  and  the  c^^   sum  to  aero  by  row  and  column  and 


p-1^ 
for  22c 


ij 


i;}- 


The  reduction  in  sum  of  squares  due  to  fitting  pL,g,m,  and  c  is  th^n  calcu- 
lated by  multiplying  the  reduced  RHM's  by  the  corresponding  constant  estimates, 
as  follows 

R(ti,g,m,c)  -  (6l.35)(8366.1)+(2.9U)(396.5)+ +  (.22) (-557. 6)  -  5l6,ii72.7 

Normally,  this  reduction  in  sum  of  squares  woTild  be  subtracted  from  the 
total  uncorrected  svm.   of  squares  to  obtain  a  new  error  term.  However,  since  a 
difference  in  sexes  was  also  involved  in  this  analysis,  the  error  term  computed 
from  among  the  litters  within  the  sex  by  cross  subclasses  was  used. 
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Conrpletion  of  the  Analysis  of  Variance 


In  order  to  complete  the  analysis  of  variance,  it  is  only  necessary  to 
compute  the  sums  of  squares  for  the  general  combining  ability  effects,  maternal 
effects,  and  the  specific  combining  ability  effects.  The  sum  of  squares  for 
the  general  combining  ability  effects  is  computed  as  follows: 


S.  Sqs. G  =[2.9U     -1.3li     2.73} 


.025^21;  -.0093liO  -.0069h9 
-.0093iiO  .025U9h  -.0085511 
-.0069ii9    -.008551;      .021019 


p.9U    -1.31;    2.73*?  f^hJM  30.3li87 

^  30.3ii87  60.1i735 

[27.i;003  30.1920 

[203.9050     90.6lli8     20li.0715Q 


-1 


27.1i003 
30.1920 
60.0631 


2.91; 
-I.3I; 
_2.73J 

"2.9ii' 
-1.31; 


■"2.91; 
-1.31; 
2.73 


1035.2 


The  sum  of  squares  for  the  maternal  effects  is  computed  in  a  similar 
fashion: 


S.  Sqs. M  =-[-1.53     1.38     -5.99] r  .0311x36     -.010U52     -.0111il2l  "^ 

'     ,010ii52       .0351;96    -.01l509J 
,0111il2     -.011509       .033283] 

r-1.53    1.38    -5.89I    rU7.7l;36      21.8113 

21.896]i 


23.9123 

21.696U 
Ii5.8l60 


^183.7916    -10l;.8031    -276.225q] 


-1.53 
1.38 

-  1763.5 

The  sum  of  sqxiares  for  specific  combining  ability  is  computed  similarly: 
S.  Sqs. ~C  -1-1.33     -.22(1    .OlMn5     -.007926] 


f-1.33  -.22jr^ 


^                  6l-l  [-1.331 

.007926       .015937]  L-.22J 

-[-1.33    -.22]    r95.1;826      hlMSi]  Pl.33l 

^          -"  \^nim  .86.3639]  L-.22J 

-^37.U389    82. 15713  [-1.33] 
-  200.9 
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The  completed  analysis  of  variance  is  presented  below: 
Source  of  Variation  d.f .    Sum  of  Squares   Mean  Squares 


General  combining  abilities  (G)  3 

Maternal  abilities  (M)  3 

Specific  combining  abilities  (C)  2 

Error  122 


103^.2 

1763,5 

200.9 

5786.3 


3U5.1 
587.8^ 
lOO.U 
hl.h 


Since  the  specific  combining  ability  effects  are  not  significant,  these 
equations  could  be  deleted  from  the  model  as  were  the  equations  for  the  sex- 
linkage  effects.  Assuming  the  specific  combining  ability  and  the  sex-linkage 
effects  to  be  zero,  the  mean  square  for  error  is  the  appropriate  term  for  test- 
ing the  significance  of  the  general  combining  ability  effects  as  well  as  the 
maternal  ability  effects  when  a  random  model  is  considered.  Should  the  line 
effects  be  considered  fixed,  as  in  crossing  breeds  of  cattle  where  similar 
samples  of  the  breeds  could  be  selected  for  i-epeated  experiments,  the  mean 
square  from  the  error  line  would  be  appropriate  for  testing  all  effects. 


Estimation  of  Variance  Components 

Variance  component  estimates  can  be  calculated  readily  in  this  analysis  by 
making  use  of  the  inverse  elements  of  the  segments  which  were  inverted  to 
obtain  the  sums  of  squares  to  calculate  the  coefficients  for  the  variance 
components.  The  coefficient  for  the  major  variance  component  in  each  line  of 
the  analysis  of  variance  is  computed  by  means  of  the  general  formula 


\ 


k  - 


i 


-  d.f.  £2Z^J 


m 


presented  in  the  section  on  the  two-way  classification  without  interaction, 
The  expectations  of  the  mean  squares  are  shown  below. 


Source  of  variation  d.f. 

General  combining  abilities  (G)  3 

Maternal  abilities  (M)  3 

Specific  combining  abilities  (C)  "2 

Error  122 


E(MS) 


o^  +  k3  og  +  k^  o| 


\- 


«.-    o  ^  ^^^r^H     D*i  r:-,  nr  AA    liflltl 
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utilizing  the  inverse  elements  of  the  matrix  which  was  used  to  calculate  the 
sum  of  squares  for  specific  effects, 

181.8U65  -  Ii7.1i867      ^^  ^„ 
k     «=  =  22.39 


and 


s§ 


IOOJ4  -  hl.k 
22.39 


2.1i  . 


Performing  the  same  calculations  with  the  inverse  elements  of  the  matrix 
used  to  calculate  the  s\jm  of  squares  for  maternal  effects, 

135.2^38  -  U5.0800       ^^    ., 


and 


^ 


587 >8  -  U7.U 
22.51; 


2I1.O  * 


The  inverse  elements  used  in  the  computation  of  the  sum  of  squares  for  the 
general  combining  abilities  are  then  used  to  calculate  ki  ,  as 

178.2813  -  58.6273      ,^  ^, 
k^  =  -T •  -  29.91  . 


The  coefficient  for  the  component  of  of  appearing  in  the  E(MS)  for  the 
general  combining  abilities,   ko,  may  also  be  determined  from  the  matrix  used  to 

compute  the  sum  of  squares  for  general  combining  abilities,  when  all  subclasses 
are  filled.     The  formala  for  calculating  this  value  in  this  case  is 

i  '^•^-  ij 

k»   »  — — — 


Utilizing  this  formula. 


m(m-2) 

178.2813  -  58.6273 
8 


lli,96 


and 


^4 


3U5.1  -  n.h.  -  (lli.96)(2.1.). 

29.91 


8.8 


Mean  Separation 

Since  significant  differences  were  found  among  the  maternal  effects,  mean 
separation  procedures,  such  as  Duncan's  Multiple  Range  Test,  can  be  applied  to 
determine  which  means  differ  significantly  from  one  another.  If  the  lines  or 
breeds  were  considered  as  fixed,  significant  differences  would  also  have  exist- 
ed among  the  general  combining  ability  effects.  Since  the  mean  separation 

-I38-. 
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procedures  are  the  same  for  all  effects,  the  test  -will  be  used  only  for  the 
general  combining  abilities.  The  computational  details  regarding  this  test 
were  discussed  in  both  the  first  and  second  sections,  so  the  results  of  this 
test  are  presented  in  Table  18  without  further  explanation. 

Table  18.  Mean  separation  for  general  combining  ability 
effects  mth  Duncan's  Multiple  Range  Test  ( .0^   level) 

Comparison  (y±-y^)   VLii^. 


C^^+C^^-2C^J 


Product 


differences 


of  Zpn2 


gl  ▼s   g^ 
gl  vs   g^ 

.21 
ii,28* 

5.610 
5.357 

g^rs  g^ 

7.27* 

5.366 

S  "  '2 

g^^s  g^ 

li.07» 
7  .06* 

5.ii75 
5.1i72 

^2  "^  \ 

2.99 

5.U99 

1.18 

15>.28 

22.93 

20.31 

39.01 

21.00 

22.28 

1^.28 

38.63 

20.31 

I6.hh 

19.28 

Uae  of  the  Transformation  Matrix 

The  writing  out  of  the  coefficient  matrix,  the  subsequent  subtractions 
performed  in  imposing  restrictions  on  the  constant  estimates,  and  the  final 
inversion  of  the  matrix  is  a  time-consuming  process.  Much  of  the  work  can  be 
eliminated  if  a  transformation  matrix  is  available,  or  can  be  written  out 
quickly,  as  is  possible  with  many  designs.  Advantage  can  then  be  taJcen  of  the 
fact  that  the  inverse  of  the  matrix  of  subclass  numbers  (a  diagonal  matrix)  can 
be  obtained  readily  as  this  is  nothing  more  than  the  reciprccals  of  the  sub- 
class numbers.  In  addition,  the  constants  are  merely  the  subclass  means. 
Transformation  of  the  inverse  of  the  subclass  numbers  then  yields  the  inverse 
matrix  that  would  have  been  obtained  under  the  methods  outlined  in  this  section. 
This  transformation  is  accomplished  by 

vhere  K  ■  the  transformation  matrix,  D"*l  «  the  inverse  to  the  matrix  of  sub- 
class numbers,  and  K'  is  the  transpose  of  the  transformation  matrix.  Trans- 
formation of  the  constant  estimates  (the  subclass  means)  then  yields  the  con- 
stant estimates  that  would  have  been  obtained  by  the  methods  of  this  section. 
Transformation  of  the  subclass  constant  estimates  is  accomplished  by 

KB 

where  K  is  the  transformation  matrix  and  B  is  the  vector  of  subclass  means. 

The  transformation  matrix  for  the  model  under  discussion  and  the  general 
formulas  for  writing  out  the  transformation  matrices  for  single  cross  data 
involving  more  than  four  lines,  when  the  linebreds  have  not  been  included,  havo 
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been  gLven  by  Harvey.  15/  The  transformation  matrix  for  the  model  under  discus- 
sion, the  subclass  means,  the  reciprocals  and  the  transformed  constant  estimates 
are  presented  in  Table  19  • 

Table  19«  The  transformation  matrix  (K),  the  subclass  means  the  reciprocals  of 
the  subclass  numbers,  and  the  transformed  constant  estimates. 

Constant 
Estiniates 


Sl2Sl3Sil;S2iS23S2li33iS3233[^s|j_s^2SU3    Means 


Reciprocals 


V- 

'2 

2     2 

2 

2 

2     2 

2 

2 

2 

2 

f 

63.?33 

.0833333 

61.378 

H 

6 

6    6 

0 

-3 

-3     0 

-3 

-3 

0 

-3 

62.100 

.1000000 

2.9lil 

§2 

0 

-3  -3 

6 

6 

6-3 

0 

-3 

-3 

0 

-3 

65.625 

.0833333 

-1.29li 

^3 

-3 

0  -3 

-3 

0 

-3    6 

6 

6 

-3 

-3 

0 

59.333 

•0833333 

2.753 

m^ 

-6 

-6  -6 

6 

0 

0    6 

0 

0 

6 

0 

0 

59.7iiO 

.1000000 

-1.61a 

m|¥ 

6 

0     0 

-6 

-6 

-6    0 

6 

0 

0 

6 

0 

61.050 

.1000000 

I.U26 

"^3 

0 

6    0 

0 

6 

0  -6 

-6 

-6 

0 

0 

6 

6I1.275 

.0833333 

-5.78U 

^12 

U 

-2  -2 

1; 

-2 

-2  -2 

-2 

h 

-2 

-2 

L 

61.1^2 

.0833333 

-1.285 

^13 

«2 

U  -2 

-2 

-2 

li     h 

-2 

-2 

-2 

u 

-2 

66.6SQ 

.0833333 

-.172 

^12 

6 

-3  -3 

-6 

3 

3    3 

-3 

0 

3 

-3 

0 

61M6 

.071)i?R6 

.767 

^13 

-3 

6-3 

3 

-3 

0  -6 

3 

3 

3 

0 

-3 

57.I1OO 

.1250000 

.981; 

^23 

-5 

-3    0 

-3 

6 

-3     3 

-6 

3 

0 

3 

-3^ 

50.1-50 

.0833333 

1.229 

The  transformed  constant  estimates  shoi^m  in  the  last  column  of  table  8 
ware  computed  as  foUovs: 

^   (2)(63.933)  +  (2)(62.1CO)  +  +  +  (2)(50.ii50) 


2U 


=  61.378 


.   (6)(63.933)  ^  (6)(62.10Q)  +  -t-  .  (3)(5o.l5o)    „  .  ^ 
^  2I4 


;:   (3)(63.933)  -  (3)(62>m30)  -^  -f  -  (3)(50.l-5o) 
^^  2U 

These  constant  estimates  are  the  same  as  those  presented  in  table  It  and  thus 


15/    Harvey,  Walter  R.     Ar-alysis   of  data  with  isiequal    subclass   pimbers 
when  a  '8«t  of  orthogonal  comparisons   among  the    subclass   neans  is    desired. 
Paper  presented  in  the  Biometrics  Society  Section  of  the  AIBS  meetings  at 
Pennsylvania  State  University,  August  31,   1959. 
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are  the  same  as  would  have  been  obtained  by  the  methods  outlined  previously  in 
this  section.  The  inverse  elements  are  obtained  as  follows: 


C^  ^   (2)''(>0833333)^(2)^(a000000)+  +  +(2)^(. 0833333)    ^  .007l;98 

(2li)2 

C^&2.   =  (^)(^)^«OQ33333H(2)(6)(.10QOOOO)4-  ^      -   (2)(3)(.0833333)  ^  ^^^qq^^^ 

(2ii)2 


csiei 


(6)^(>0833333)+(6)^(a000000)  +  +  -t-  (-3)^(>08  33333) 

576 


•025651 


Cgigi  =  (6)(-3)(a000OOO)-(6)(3)(>0833333)  +  +  4-(3)(3)(»0833333)  ^  ^^0053^^ 


576 


etc. 


"While  the  transformed  inverse  matrix  C-^  can  be  calculated  on  a  desk 
calculator,  these  matrix  multiplications  can  be  completed  readily  on  high 
speed  computers,  saving  considerable  time.  Upon  completion  of  the  inverse 
matrix,  the  analysis  would  proceed  as  outlined. 

SIMULTANEOUS  ANAL:y5IS  OF  DATA  FROK  GROSSES  MID   PUHE5REDS  I 

When  it  is  possible  to  include  linebreds  or  purebreds  in  the  design  of  a 
single  cross  experiment  an  estirrate  of  heterosis  is  then  available.  The  line- 
bred  effects  mav  be  estimated  independently  of  or  simultaneouslv  with  the 
general  combining  ability  and  maternal  effects.  The  analysis  to  be  considered 
in  this  section  includes  the  linebreds  in  the  exr>eriment  but  estimates  these 
effects  independentlv  of  the  general  combining  abilitv  and  maternal  effects. 


Model 

The  mathematical  model  underlying  this  analvsis  is 
yhijk  -  Ji  +  ay,  +  piii  +  g2±   ""  g2j  +  "»2j  +  <=2ij  "^  ^21^  +  ehijk 


(8) 


where  yhijk  "  the  k'th  observation  on  the  progeny  of  a  mating  of  a  dam  from  the 
j'th  line  with  a  sire  of  the  ith  lir.e  in  the  hth  fvpe  of  breeding  (purebred  or 
crossbred);  \i   «  the  over-all  mean  when  equal  subclass  freouencies  exist  and 
there  are  equal  frequencies  of  linebreds  and  crossbreds;  ah  "  ar^  effect  com- 
mon to  all  progenv  of  the  h^^  tvpe  of  breeding  (linebred  or  crossbred),  the 
difference  between  these  effects  being  an  estimate  of  heterosis;  and  Pi^i  = 
an  effect  common  to  all  progenv  of  a  mating  between  a  dam  of  the  i'th  line  and 
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a  sire  of  the  i'''^  line.  The  rerraining  symbols  represent  the  same  effects  as 
described  in  Jiiodel(7).   The  subscript  1  denotes  that  the  effects  are  measured 
only  among  the  progeny  of  the  first  type  of  mating  (linebred)  and  the  sub- 
script 2  denotes  that  the  effects  are  measured  only  among  the  progeny  of  the 
second  type  of  mating  ( crossbred), 

Least-Squares  Eqiiations 


The  least-squares  equations  for  this  model  are  presented  in  Table  20.  It  ""  I 
can  be  seen  from  these  equations  that  while  the  sums  of  the  coefficients  for  "  -  I 
the  heterosis  effects  equal  the  coefficient  for  p,  in  the  p.  equation,  none^of     -  ^  \ 

the  sums  of  coefficients  for  the  other  effects  equal  the  coefficient  for  \i.  j 

Also,  in  the  sums  making  up  the  RHM's,  only  the  sum  of  the  RPIM's  for  the  | 

heterotic  effects  equals  the  sum  for  the  u  equation.  Since  the  linebreds  are  | 

not  used  in  estimating  the  general  combining  abilities  or  maternal  abilities,  | 

and  are  not  irrvolyed  in  estimating  specific  or  sex-linkta  effects,  several  | 

cells  in  the  table  contain  no  entries.  Again,  it  is  noted  that  in  the  seg-  I 

Bient  of  the  coefficient  matrix  corresponding  by  row  and  column  to  the  general  | 

combining  ability  effects,  there  are  entries  in  the  off -diagonal  elements.  | 
In  the  segments  corresponding  to  the  other  effects,  only  zeros  apoear  in  the 
of f-dia gonals , 

Imposing  Restrictions  on  the  Constant  Estimates 

The  restrictions  imposed  on  the  constant  estimates  with  this  nxDdel  are 
similar  to  those  of  model  (7)  except  that  the  tyoe-of -breeding  effects  and  the 
purebred  effects  must  be  taken  into  account.  VJhile  many  restrictions  are 
possible,  the  following  are  convenient  for  this  analysis: 

2Sh  -  piii  -  2g2i  =  2m2j  «Zcij  =  Zc^^  =  ZL-   Z^^  =   Zrij  =  Zr^i  =  ?ij+?ji  -  0. 

These  restrictions  force  the  constant  estimates  for  type-of-breeding-effects 

to  sum  to  zero  about  jl  the  constant  estimates  for  purebred  effects  to  sum  to  I 

zero^about  (1  +  ai ,  and'the  remaining  constant  estimates  to  sum  to  zero  about  | 

\i   +  &2*     '^^  restrictions  on  the  constant  estima.tes  for  the  ai^  and  p^ii  ^^^  \ 

effected  by  subtraction  bv  row  and  column  as  discussed  in  the  first  section.  I 

The  remaining  restrictions  are  imposed  by  the  same  orocedures  as  outlined  for  | 
model  (7).                                                           "   '   I 

Inversion  of  the  Reduced  Matrix  "  ^ 

After  the  restidctions  discussed  above  have  been  imposed  on  the  constant 
estimates,  there  will  remain  a  symmetric  matrix  of  order  p^,  where  p  =  the 
number  of  lines  included.  Inversion  of  a  matrix  of  this  order  is  o-f  course 
most  readily  accomplished  on  a  high  speed  comcuter. 
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Computing  the  Least-Squares  Means  and  Standard  Errors 

The  constant  estiniates  for  the  least-squares  mean  and  all  effects  in- 
cluded in  the   model  can  be  obtained  by  solving  the   equations  together  with-  the 
Inversion  of  the  variance-covariance  natrijc.     If  this  is  not  done,    then  the 
constants  can  be  estimated  from  the  inverse  elements  and  the  RHM's  of  the 
equations  by 


Z   Ci^Tj  =  c^ 


where  C^J  is   the  inverse  element  for  the  i'th  row  and   jth  column  of  the  matrix 
inverse,   Yj  is   the  RHM  for  the  j'th  row  and  ci  is  the   estimate  of  the  ith  con- 
stant.    Since  the   constant  estimate  for  the  type-of-breeding  effect  is  deter- 
mined as  a  deviation  from  jl,    the  least-squares  mean  for  the  purebreds  is 
given  by 

and  the  least-squares  mean  for  linecrosses  or  crossbreds  is   given  by 

Ii  +  a2 
The  least-squares  mean  for  a  purebred  group  is  then 

il  +  ai  +  piii 
and  the  least-squares  mean  for  a  linecross  group  is 

II  +  a2  +  g2i   +  g2J  +  hi  ^  ^2ij  "^  ^21^   . 
The  standard  errors  for  these  means  are  determined  as  follows: 


sffi+ai)   -  V  0^  *  C^l^l  H.  2(^1     ae 


s(ji+a2)  -  V  CW^  +  C^2a2  +  2C^2     ^^ 


«(;i+ai+^..)  =  >/  OHi  +  C^l^l  +  cPlii  +  20^1  +  2C>^il+  2C^lPlii     % 

(^i+a2+g2i+g2j+m2j+C2ij-^r2ij)       ^^         ^  +  o     j     j 

+  C«^2ij<^2iJ  +  C^2ijr2ij  ^  3^2  +  2CJ^g2i  +  2d^g2j  +  20^^^:!  +  ZC^^^^^i 
+  20Ar2ij  +  2C^2g2i  +  2C^2g2j  +  2C^2»n2j  ^  2C^2C2ij  +  2C^2r2ij 

-  2Cg2ig2J  +  2Cg2i"2J  +  2Cg2iCiJ  +  2Cg2ir2iJ  +  2Cg2jin2J  +  2Cg2jC2iJ 


-llOi. 
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J. 

+  2CS2jr2iJ  +  2C^J^2ij  +  2C^2jr2ij  ^  20^21^^21^^^   ^^ 


where  the  C's  denote  elements  of  the  matrix  inverse  to  the  variance-coyariance 
matrix,  the  superscripts  designate  the  partictilar  elements  and  g^   is  the  error 
standard  deviation.  While  the  standard  errors  for  the  over-all  mean,  pure- 
breds,crossbreds  and  piirebred  groups  can  be  obtained  readily,  it  is  obvious 
that  considerable  effort  is  involved  in  obtaining  the  standard  errors  for  the 
crossbred  groups  by  tiie  use  of  the  formula  given  above.  A  shorter  method  of 
obtaining  these  constants  will  be  presented  later. 

The  error  variance,  Oq,   is  normally  obtained  by  the  following  formula 

222Z  ygiik  -  R(|J.,a,g,m,c,r) 

^   .  hi.i^    ^ 

e 

d.f ,  for  error 

where  Z222  yh±-^]r   =  the  total  uncorrected  sum  of  squares,  and  R(|a,q,g,mj,c,r)  ■ 

hijk  '  '^ 
the  totiil  reduction  in  sum  of  squares  due  to  fitting  all  constants.   The  total 
reduction  in  sum  of  squares-is  obtained  by  the  same  procedures  as  presented 
for  model  (7). 

Computing  Sums  of  Squares  for  the  Analysis  of  Variance 

The  sums  of  squares  are  obtained  by  the  Drocedures  discussed  previously, 
applying  the  formula 

B»Z-^  B 

irtiere  B'  is  a  row  vector  of  the  constant  estimates  for  the  effect  considered, 
B  is  a  column  vector  of  these  constant  estimates,  and  Z  is  the  segment  of  the 
matrix  inverse  to  the  variance-covariance  matrix  corresoonding  by  rows  and 
columns  to  the  particular  set  of  effects.  The  sums  of  squares  may  also  be 
obtained  by  differences  in  reductions. 

The  breakdown  of  the  degrees  of  freedom  for  the  analysis  under  model  8 
are 

Source  of  Variation  Degrees  of  Freedom  \ 

Total  n. ..  -  1 

Heterosis  1 

Among  purebred s  p-1 

General  combining  abilities  p-1 

Maternal  abilities  p-1 

Specific  combining  abilities  P^^~3) 

Sex-linked  effects  P^g'^)  +3^ 

Error  n*««  — p 
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where  p  is  the  niunber  of  lines  included  in  the  experiment. 


Construction  and  Apolication  of  Transformation  Matrices 

The  analysis  of  data  of  the  type  described  by  model  (8)  can  be  greatly 
facilitated  by  the  use  of  a  transformation  matrix.  The  method  of  constructing 
a  transformation  matrix  to  be  presented  here  for  model  (8)  is  a  general  one 
and  can  be  followed  for  other  least-squares  analyses.  As  a  first  step,  the 
least-squares  coefficient  matrix  is  written  out  following  the  equations  de- 
scribed for  this  model,  considering  only  one  observation  in  each  of  the  p*^ 
subclasses.  Imposing  the  suggested  restrictions  on  the  constant  estimates  re- 
sults in  the  reduced  least-squares  coefficient  matrix  presented  in  Table  21. 


Table  21.  Reduced  least-squares  coefficient  matrix  with  on6 
observation  per  subclass  for  model 
yhl^k   »  fi  -»■  a^  +  piii  +  g2i  +  g2j  "^  ^j  C2ij  +  r2i-j  +  e^ijk 
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This  matrix  is  inverted  reaifily  since  each  of  the  five  small  segm.ents  can  be 
inverted  separatelv  to  yield  the  matrix  inverse  shown  in  Table  22.  The  next 
.fetep  is  to  write  out  algebraically  the  subclasses  which  are  used  in  measuring 
each  of  the  effects  included  in  the  model.  Imposing  the  appropriate  re- 
strictions on  these  values,  the  following  RHM's  are  obtained: 
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H:  ZZsij 
^1*  ^2sij  -  ?2sij 
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Pin-  sii  -  si,[j 

P133-  ^33  "  %U 
£21 :    (2sij  +  Zsii)  -  (Zsj^j  +  Zs^i^) 

622-  (252 j  +  ?Si2)  -  (Zsj,^  +  Zsi^) 
£23:  (Zso^  +  Zsi3)  "  ^-^^^j  ■"  ?^iii^ 
^21*  ?Sii  -  Zsii^ 


^. 
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^23-  |Si3  -  |Sili 
Table~22,     Matrix  Uiverse  to  least-squares  coefficient  matrix 
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^212-  si2  -  s-^l 


-  Si.o  -  S 


^li2 


21  ■"  ^21;  "■   =[a 


^213'  ^13  ~  ^lli  "  ^31  *  ^31  ■"  ^lil  -  ^ii3 

^223*  ^23  -  ^2li  -  ^32  ^   ^31  "^  ^li2  "  %3 

where  Sj_j  refers  to  the  subclass  for  the  progeny  resulting  from  a  mating  of  a 
sire  of  the  i^h  i-jj^g  -with  a  dam  of  the  jth  line. 

The  third  and  final  step  is  the  multiolication  of  the  RHM's  given  above 
by  the  matrix  inverse  to  the  least-squares  coefficient  matrix  shovm  in 
Table  22,  as  follows: 


i=J 


i+J 


2h  ij 


ai:  _L  Z2si4  +  1_  {ZXs±a  -  lZs±^)  =  3-  ZZsi^  -  1  ^2si  ^ 

2UiJ^   12  ij  ^   ^'0  ^    2lli.i^   2i:ij^ 

i«j     i+j        i=j       i+j 

Pill-  J  (sii-s^i,)  -  jj  (s22-si,l,)  -  j;  (s33-si,i,)  «  ^  sn-i  522-^533.^3^^^ 
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•   \      •',: 


1  1 

^223-  8  (n2-siU-sl;2-S2i+S2i,-Hsj^}  --  (^2'^lh'^31*^3h^%l'\3^ 

*  1:  (s23-S2h--32*S3l:-s^2-s^3)  =  |  ^12'  |  ^13-  |  ^21*  ^  ^23"  |  ^2h 


*  8  31'  I  ^32*  8  ^3^+  gSi,2  -  o  s 


8  ^^3 


The  values  resulting  from  the  above  multiplications  are  the  coefficients 
for  the  transformation  matrix,  K,  which  is  shown  in  Table  23.  If  D  is  the 
least-squares  coefficient  matrix  for  the  subclasses,  then  the  matrix  inverse, 
C"l,  to  the  least-square  coefficient  matrix  for  model  (8)  is  obtained  by 

KD-1  K' 

where  K  is  the  transformation  matrix,  D~l  the  matrix  inverse  of  D,  and  K*  l«  the 

transpose  of  the  trai3slbmHtiDnmatrix;  Since  D  is  a  diagonal  matrix,  its  inverse  can  be 
obtained  quickly  by  calculating  the  reciprocal  of  each  diagonal  element.  The 
constant  estimates  are  computed  by 

IS  '  B 

where  S  is  a  column  vector  of  the  subclass  means  and  B  is  the  column  vector 
of  the  constant  estimates. 
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Table  23.   Transformation  inatrix  for  model.  y>iijk=M-"*"ah"'"Plii"''S2i'*'g2J+"'2 j+*^^ii 
■*"^2ii'*'®M-'k*  Purebred  effects  estiinated  independently  of  general  combining 
ability  and  maternal  ability  effects. 
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It  can  be  seen  that  if  a  simple  transformation  matrix  is  available  for 
the  model  desired,  a  great  deal  of  labor  can  be  saved  in  this  type  of  an- 
alysis. While  many  transformation  matrices  can  be  written  readily  without 
using  the  procedure  outlined  above,  those  for  more  complex  designs  can  be  ob- 
tained in  this  manner.  As  had  been  pointed  out  previously,  transformation 
matrices  are  also  useful  when  making  orthogonal  comparisons  among  the  con- 
stant estimates  of  treatment  effects,  where  one  is  concerned  with  only  a,  seg-    ;^ 
msnt  of  the  entire  matrix  inverse,  and  only  those  estimates  of  the  constants  N^~ 
for  the  treatment  effects  under  consideration.  ^'^ 
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Nmnerical  Examnle 

The  data  used  in  the  numerical  exaTrole  are  those  used  nreviously  for 
model  (7),  with  the  addition  of  the  linebreds,  which  were  included  in  the  ex- 
periment but  omitted  for  the  previous  example. 
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Least-Squares  Analysis 

The  least  squares  equations  are  shown  in  Table  2L  where  it  can  be  seen 
that  onlv  the  segment  corresponding  by  row  and  column  to  the  general  com- 
bining ability  effects  contains  entries  in  the  off-diagonals.  All  other  seg- 
ments corresponding  by  row  and  column  to  one  of  the  effects  being  estimated 
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contain  entires  in  the  diagonals  only.  It  can  also  be  observed  that  the  sum 
of  the  coefficients  for  the  general  combining  ability  effects  (in  all  equa- 
tions in  which  there  are  entries)  equals  twice  the  coefficient  for  p.,  except 
in  the  [i   equation  due  to  inclusion  of  purebreds  in  the  analysis. 

After  imoosing  the  restrictions  on  the  constant  estimates  suggested  for 
this  model,  the  coefficient  matrix  can  be  inverted  and  the  inverse  elements 
necessary  for  estimating  constants,  standard  errors,  and  calculating  sums  of 
squares  for  the  anal^ASis  of  variance  can  be  obtained.  The  constants  may  be 
estimated  during  the  process  of  inversion. 

When  a  transformation  matrix  is  available  the  inverse  elements  and  the 
constant  estimates  can  be  obtained  more  readily  than  by  the  inversion  of  the 
least-squares  coefficient  matrix.  Using  the  transformation  iratrix  which  has 
been  presented  for  this  model,  the  desired  matrix  inverse  to  the  least- 
squares  coefficient  matrix  can  be  calculated  by  means  of  a  desk  calculator  in 
a  very  short  time.  The  constants  estimates  are  also  computed  simroly  and 
quickly. 

Since  the  same  observations  were  utilised  in  estimating  the  general  com- 
bining ability,  maternal,  specific  combining  abilitv,  and  sex-linked  effects 
as  in  the  previous  model,  the  inverse  elements  pertaining  to  these  effects 
and  the  constant  estimates  are  the  same  as  were  obtained  in  the  previous  an- 
alysis, within  rounding  error.  With  the  inclusion  of  the  linebreds  in  this 
analysis,  additional  constant  estimates  are  obtained  for  heterotic  and  pure- 
bred effects.  The  constant  estimates  resulting  from  the  analysis  under 
model  (8)  are  shown  below: 


ji   -    60.322 

g22   -    -1.293 

013    "    -•^''2 

t-L    -    -1.057 

g23    -      2.753 

ri2   ■      .766 

mi  -   -3.310 

ih2i    ■    -I.6I1I 

ri3    -      .982 

P122   *      7.081 

2122    "      I.U26 

r23   -    1.229 

P133   -    -5.U73 

^23    ■    -5.78a 

g2i   -      2.9la 

C12    -    -1.285 

The  remaining  constant  estiirates  (pil^Jj,  g2i^,  etc.)  can  be  obtained  by  con- 
sidering the  restrictions  that  were  imposed  on  the  estimates  and  m.aking  the 
appropriate  additions  and  subtractions.  Due  to  the  size  of  the  matrix  in- 
verse, this  is  not  presented.  ^ 

The  total  reduction  in  sums  of  squair^^f^is  calcxilated  by  the  usual  pro- 
cedures of  obtaining  the  sum  of  the  oroducts  of  the  constant  estimates  by  the 
corresponding  RHM's 

2(|j.,a,p,g,n,c,r)  =  779,783. ijli 

This  value  can  also  be  obtained  %r  c^ilculating  the  uncorrected  sums  of  squares 
for  the  subclasses.  y  , 

The  error  sum  of  squares  is  then  calculated  as  the  difference  between  the 
total  uncorrected  sum  of  squares  and  the  reduction  due  to  fitting  all 
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constants. 


Error  sum  of  squares  =  793,111.89  -  779,783. hli  =  13,328.L5. 


The  sums  of  squares  for  heterosis,  purebreds,  general  combining  ability, 
maternal,  specific  combining  ability,  and  sex-linked  effects  are  obtained  by 
the  procedxires  previously  outlined,   Comoletion  of  these  scorns  of  squares  re- 
sults in  the  following  analysis  of  variance: 

Source  of  Variation 

Heterosis  (A) 

Purebreds  (P) 

General  combining  abilities (G) 

Jfeternal  abilities  (M) 

Specific  combining  abilities(C) 

Sex-linked  effects  (R) 

Error 

In  view  of  the  small  sum  of  squares  for  the  sex-liried  effects,  these 
effects  could  be  eliminated  from  the  model  and  new  estiirates  computed  for  the 
remaining  effects.  Should  the  specific  combining  ability  effects  prove  to  be 
non-significant  after  the  new  estimates  have  been  made,  these  effects  could 
also  be  eliminated  from  the  model  and  more  efficient  estimates  made  of  the  re- 
maining effects. 

If  standartH  errors  are  desired,  they  can  be  computed  from  the  formulas 
previously  presented,  as  follows: 


d.f. 

Sums  of  Squares 

Kean  Squares 

1 

192.93 

192.93 

3 

1878.50 

626.17 

3 

lOla.I-h 

3ii7.l5 

3 

1725.26 

575.09 

2 

180.73 

90.36 

3 

267.73 

89.2li 

192 

13328.1.5 

69.  h2 

sJL  »  V  .00576991;  V  69.1i2  =  (.0761) (8. 33)  =  .63 

s(^+a3_)  -  >/  .0057899h  +  .0057899ij  +  2(. 00201077)  (8.33)  =  (.125)(8.33) 
=  l.Oit 

S(ji+t3^+^^)  =  r.0057899li  +  .00578991  +  .03838870  +  2(  .00201-077) 

+  2(-.00211i889)  +  2(-,0021ii889J   (8.33)  =»  (.213) (8. 33) 
-  1.77 

The  standard  error  for  a  linecross  or  crossbred  group  can  most  easily  be 
obtained  from  a  consideration  of  th^'  diagonal  matrix  made  up  of  the  subclass 
(linebred  and  crossbred  groups)  nl^iers.  Each  of  these  subclasses  contain 
the  \L   effect  plus  subclass  effect,  so  the  inverse  element  from  this  diagonal 
element  corresponding  to  a  subclass,  Sj^j,  is  the  element  needed  to  determine 
the  standard  error,  sp^+g.  .,  To  calculate  the  standard  error  of  the  cross  be- 
tween a  sire  of  the  1st  Ixne  and  a  dam  of  the  2nd  line,  the  subclass  number 
is  12  and  its  inverse  1^2.  Therefore,  the  standard  error  for  this  crossbred 
or  linebred  group  is  4 


^J.2 


'^ 


y/  SS.hZ  ^  V  5.7850  -  2.1a 


- 1 


I 


-152- 


...^«3»»  rhi  vh;«  itm'r%  Lir^v  iinniijitiiift  Convrioiit  intrinaomsnt 


IT' 


7:\ 


The  data  which  have  been  used  in  this  analysis. could 'also  have  been  an- 
alyzed using  the  model  « 

yijk  -  VI  +  «i  +  dj  +  (sti)ij  +  cijic (9)^ 

where  Jijlc  ^  "the  observation  on  the  k'th  offspring  of  a  mating  between  the  j'th 
dam  and  x^^   sire.  While  an  analysis  of  this  model  does  not  ^rleld  the  de- 
tailed information  of  the  urevious  analysis,  it  does  offer  a  measure  of  the 
over-all  performance  of  the  lines  as  sires  and  dams.  The  equations  for  an 
analysis  under  this  model  would  be  written  in  the  usual  way  for  a  two-way 
classification  with  interaction.  Again,  the  availability  of  a  transformation 
matrix  lessens  the  labor  consideraly.  The  transformation  matrix  for  model  (9) 
is  shown  in  Table  2$   for  the  case  of  four  lines. 

Table  25«  Transformation  matrix  for  model  yijk  "  M''*"Si"'"Clj'^(sd)|^i+e^i|^, 

s^3  sii 


SI 
S2 
S3 
di 
d2 

'^1 

sdi2 
sdi3  .  .< 
sd2i 

sd22, 

sd^3 

sd3i 

sd32 

sd33 


sii  S12 

"l  1 

3  3 

-1  -1 

-1  -1 

3  -1 

-1  3 

-1  -1 

9  -3 


-1 
3 


-3 
-3 
-3 

1 

1 

-3 

1 
1 


9 
-3 

'^ 

-3 

1 

1 

-3 

1 


S13  ^h  ^21 

111 

3       3 

-1     -1 

-1    -1     -1 

-1    -1      3 

-1    -1     -1 

3    -1    -1 

-3    -3     -3 

-3    -3 

9    -3 

1      1 

1 

1 

1 

1 

1 


S22  S23 

1  1 

-1  -1 

3  3 

-1  -1 

-1  -1 

3  -1 


1 
-3 

1 

1 

-3 


1 

1 

9 

-3 

-3 

-3 

1 

1 


-1 

1 

-3 

1 


3 

1 

1 

-3 


-3  -3 

9  -3 

-3  9 

1  1 

-3  1 

1  -3 


S2lj  S31 

1  1 

-1  -1 

3  -1 

-1  3 

-1  3 

-1  -1 

-1  -1 

1  -3 

1  1 

1  1 

-0  -3 

-3  1 

-3  1 

1  9 

1  -3 

1  -3 


S32  S33 

1      1 

-1  -1 

-1  -1 

3      3 

-1  -1 

3  -1 


S3li  sui  SI,  2 

111 
-1  -1  -1 
-1     -1     -1 


3 
-1 


-1 

1 

-3 
1 
1 

-3 

1 


3 

1 
1 

-3 
1 

1 
-3 


-1  -1 

3  -1 

-1-1  3 

-1    -1  -1 

1    -3 

1 
1 

-3 

1 
1 


1 
1 
1 
1 
1 


-3     -3 

9    -3 

-3       9 


-3  -3 
-3  1 
-3      1 


1 

-3 
1 

1 

-3 

1 
1 

-3 

1 


1  1 

-1  -1 

-1  -1 

-1  -1 

-1  -1 

-1  -1 

3  -1 

1  1 

1  1 

-3  1 

1  1 

1  1 

-3  1 

1  1 

1  1 

-3  1. 
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An  analysis  of  the  data  of  the  present  example  using  model  (9)  yielded 
the  following  constant  estinates: 


5  -   60.850  33    -  -L.330 

$3^   -     1,053  £»^)xL    -  -5.361 

^2    «       .767  (sd)x2  ■  --^63 

%    *    X.I45U  (»<i)i3-  li.526 

%    -    -.588  (^).^i  -  -1.696 


(8^)23  -  2.li52 
(8d)3^  -  2.559 

(«d)32,  -  -.005 
(8^)33  -  ^a.l83 


The  remaining  constant  estiirates  for  sires,  dams,  and  interactions  can  be  ob- 
tained by. the  usual  procedures.  Total  reduction  in  sum  of  squares,  error  sum 
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of  squares,  and  the  suns  of  squares  for  sires,  dams,  and  interactions  also 
follow  the  methods  previouslr  outlined.  The  analysis  of  variance  following 
the  coiTiDletion  of  these  coinputations  is  shown  below: 


Source  of  Variation   d.f. 


Sires  (S) 
Dams  (D) 
S  X  D 
Error 


3 
3 

9 
191 


Sums  of  Squares 

658. [-9 

1L09.62 

1812.72 

13327.55 


Mean  Squares 

219.50 

169.87 

201.1-1 

69.78 


The  significant  interaction  of  sires  and  dams  is  not  unexpected  in  an  ex- 
periment of  this  tvpe  and  m.akes  the  analysis  under  model  (9)  of  very  limited 
value.  If  one  is  concerned  only  with  the  particular  crosses  produced  in  the 
experiment,  this  analysis  does  measure  the  value  of  the  lines  as  sires  and 
dams.  However,  there  is  no  information  as  to  what  type  of  gene  action  is  in- 
volved and  so  the  analysis,,  is  not  of  value  in  formulating  breeding  nlans 
where  one  wants  to  have  information  on  both  additive  and  non-additive  effects, 
such  as  are  brought  out  under  models  (7)  or  (8).  Since  the  interaction  is 
significant  it  is  of  importance  to  know  what  contribution  specific  combining 
ability  and  sex-liriced  effects  have  made. 

Analysis  of  the  data  under  model  (8)  yielded  an  estimate  of  the  over-all 
amount  of  heterosis  and  a  test  of  the  significance  of  this  effect.  The  use 
of  a  simple  transformation  matrix  makes  it  possible  to  obtain  estimates  of  the 
amount  of  heterosis  and  the  inverse  elements  necessary  to  test  the  signifi- 
cance of  the  heterosis  effect  in  each  cross,  and  also  to  make  testa  of  sig- 
nificance among  the  heterosis  estimates.  While  the  constant  estimates  and  the 
tests  of  significance  are  not  independent,  since  the  purebreds  or  linebreds 
are  each  used  three  times,  they  «r«  of  considerable  interest.  The  trans- 
formation matrix  is  shown  in  Table  26  together  with  the  transformed  matrix  in- 
verse of  the  subclass  numbers.  It  can  be  seen  from  the  transformation  matrix 
that  the  constant  estimate  for  heterosis  in  each  cross  is  a  contrast  between 
the  average  of  the  purebreds  and  the  average  of  the  reciprocal  crosses.  The 
ranked  constant  estimates,  with  tests  of  the  significance  of  each  estimate 
and  tests  of  the  significance  among  the  estiriHtes,  are  shown  below: 


Constant 

Tests  of  Si 

-gnificance 

Cross 

Estimate 
8.3II1* 

Among  Crosses 

13 

Hi 

5.095* 

23 

2.0U7 

3U 

1.175 

12 

.1j83 

2h 

-1i.1j31 

■\ 


'Hhe   tests  of  significance  of  the  heterosis  estimate  for  each  cross  was 
■anade  by  the  analysis  of  variance,  with  a  single  degree  of  freedom  for  each 
estimate.  The  sum  of  squares  for  each  contrast  utilizes  the  constant  es- 
timate ^and  the  inverse  element  of  the  transformed  matrix  corresponding  by  row 
^nd  column  to  the  cross  being  considered.  For  example,  the  sum  of  squares 
-for  the  estiirate  of  heterosis  resulting  from  crossing  line  1  and  line  3  is 


I ^. 
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S.Sqs. — -^13 


(6.3U)' 
'.07603030 


885. 8U 


Table  26.  Transformation  matrix  used  to  obtain  constant  estimates  of 
heterosis  effects  for  separate  crosses  and  the  transformed  matrix  inverse  of 
subclass  numbers. 


Sll  S12  «13  ^l  S2I   ^ 

22  ^^23  ®2li 

®31  ®32  "33 

=3li  ^Iil  %2  %3  ^lili 

^12* 

-.5     .5 

0       0     .5  - 

.500 

0       0       0 

0       0       0       0       0 

hi3: 

-.5     0    , 

,500 

0       0       0 

.5     0  -.5 

0       0       0       0       0 

^li- 

-.5    -0 

0     .5       0 

0       0       0 

000 

0    .5     0     0  -.5 

^23- 

0      0 

0       0       0- 

.5    .5     0 

0    .5  -.5 

00000 

^2U- 

0      0 

0       0       0- 

.5     0    .5 

000 

0     0    .5     0  -.5 

hW' 

0      0 

0       0       0 

000 

0     0  -.5 

.5     0     0    .5  -.5 

hi2 

bi3 

ftili 

h23 

^2li                ^23 

h.2 

.0626!i569 

.0113636h 

.0113636I 

.00961538 

.00961538      .00000000 

hi3 

hlli 
h23 

.0113636h 
.01136361 
.00961538 

.07803030 
.OII3636U 
.02083333 

.0113636I 
.070887lih 
.00000000 

.02083333 
.OOCOOOOO 
.07628205 

.00000000    .02083333 
.02083333    .02083333 
.00961538    .02083333 

I12U 

.(50961538 

.00000000 

.02083333 

.00961538 

.08669872    .02083333 

h3li 

.00000000 

.02083333 

.02083333 

.02083333 

.02083333     .08333333 

The  analysis  of  variance  is  shown  below: 

Source  oi 

'  Variation 

d.f.       Sum  of  Squares 

Mean  Scuare 

Heterosis 

Line  1  and  line  2 
Line  1  and  line  3 
Line  1  and  line  h 
Line  2  and  line  3 
Line  2  and  line  h 
Line  3  and  line  U 

Error                          ^ 

1 
1 
1 
1 
1 
1 
191 

3.72 

885.93 
366.16 

51.93 

226.51 

16.57 

13327.55 

3.72 
885.93^ 
366.16* 

5I1.93 

226.51 

16.57 

69.78 

I 


The  tests  of  significance  among  the  constant  estimates  for  the  separate 
crosses  were  made  by  means  of  Duncan's  Multiple  Range  Test,  the  details  of 
which  have  been  presented  previously. 

SI>mTANEOUS  AKALTSIS  OF  DATA  FROM  CROSSES  AND  PUHEBREDS  II 

The  analysis  of  the  data  under  model  (8)  estimated  the  purebred  effects 
independently  of  the  general  combining  abilitv  effects.  The  purebred  effects 
can  also  be- estimated  by  a  simultaneous  consideration  of  the  general  com- 
bining ability  and  maternal  effects  by  including  these  effects  in  the 

^155- 
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'rr'^_ ' 


\ 


equations  for  the  purebred  effects.  This  method  of  analysis  results  in  the 
saiTie  constant  estimates  and  sums  of  squares  for  all  effects  other  than  the 
purebreds.  /  £ 


/; 


Model 
The  mathematical  model  describing  this  third  method  of  analysis  is: 

yhijk  -  ti+ah+Piii+g.i+g.3+m,j+C2ij+r2ij+ehijk  ....   (10)   \ 


/ 


/ 


i- 


where  the  symbols  represent  the  same  effects  as  those  for  model  (8).  The  sub-    ' 

scripts  indicate  a  simultaneous  consideration  of  the  general  combining  ability, 

maternal,  and  Durebred  effects.  The  least-squares  equations  for  model  (10) 
are. presented  in  Table  27.  \ 


Completion  of  Analysis  \  .  I 

The  restrictions  imposed  on  the  constant  estimates  are  the  same  as  those    \  \ 
for  model  (8)  and  the  remainder  of  the  calculations  proceed  by  the  same 
methods  as  outlined  for  nwdels  (7)  and  (8).  No  numerical  example  is  pre- 
sented for  this  model  due  to  its  similarity  to  model  (8), 
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Errata  Sheet 

line  8  frcm  bottom,  "a^"  should  read  "^". 

2nd  line,  "n.a-  +  n  a  "  should  read  "n^a^  + 
2  2    p  p 


June,  1961 


near  bottom  of  page. 


-i 


2  -1 
b   2 


"^2  1        • 


•  •  *r  n  a  • 
P  P 


line  4  from  bottom,   "s  'V should  be  "s".  f   . 

line  7  from  bottom.  Circumflex  omitted  over  (i* 

line  4.  "(T^^i"  should  read  "2C^i". 

heading  at  top  of  page.   "Equation"  should  read  "Equations". 

^h;:  /4.0000\ 

under  (c).  Equal  sign  omitted  in  front  of  "1 6.2022 )"„ 

-^;  •  1^.4607/ 

line  4  from  bottom,  "likelihodd"  should  read  "likelihood'^, 
line  13  from  bottom,  "a  Model  I"  should  read  "as  Model  I", 
formula  for  S(b  ).  Place  "i"  under  2  -fi.^  -. 

formula  for  k,,.  Close  parenthesis.  V^r^l: 

line  13.^  Omit~"by".  ;•.  :T- -.■'■■- ^^'r^^r'^":;-'.-  '■ 

line  7.  Plus  sign  emitted, 
line  3.  Replace  +  at  end  of  line  with  «, 
last  line.  Change  "be"  to  "by". 

formula  for  of.  Place  brackets  around  "HZ   yf .,  -  R(y.,g,m.c,r)", 

Ijk  >-J* 

Table  13,  last  line.  "4,-"  should  read  "r,-". 

43  43 

line  11  from  bottom.  Place  ^   over  g,,  m,,  ^-it,}   ^^^j 

line  10  from  bottom.  Place  ^   over  c . , . 

Xable  18,  last  column  heading.  Replace  Z  with  z. 
line  1.  Place  ^   over  tr^. 


^=24  ^""^   =34' 


Tabl«  21.  Place  +  between 


and  c. 


line  12  from  bottom.  Place  a  second  asterlck  next  to  3.314. 
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